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The Hon. Commissioner of Patents 

And Trademarks, 
Washington, DC 20231 U.S.A. 

Dear Sir: 

PRELIMINARY AMENDMENT 

As a Preliminary Amendment, please amend this application as follows. 

IN THE DISCLOSURE 

Please insert the following paragraph after the title and before the heading 
"TECHNICAL FIELD" 

-CROSS-REFERENCE TO RELATED APPLICATIONS 

This application is the national phase of PCT international application Serial No. 
PCT/CAOO/00738 filed June 20, 2000, and claims the priority right of provisional 
application 60/139,788 filed June 21, 1999 and 60/182,905 filed February 16, 2000. 



IN THE CLAIMS 

Please cancel the claims as amended during the international period and substitute 
therefor the following claims: 
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1 . A method of increasing the triacylglyceride content of an organism which comprises 
expressing in the organism an introduced DNA encoding a protein having glycerol 

3 -phosphate acyltransferase (GPAT) activity. 

2. The method of claim 1, wherein the organism is a plant. 

3. The method of claim 2, wherein the plant is an oilseed bearing plant. 

4. The method of claim 2, wherein the plant is of the genus Brassica. 

5. The method of claim 2, wherein the plant is Arabidopsis thaliana. 

6. The method of claim 1, wherein the organism is a yeast. 

7. The method of claim 1, wherein the DNA encodes a protein comprising a sequence 
that differs from SEQ ID NO: 6 but has at least 70% sequence homology with 
SEQ ID NO: 6 and the same function as the protein of SEQ ID NO: 6. 

8. The method of claim 1, wherein the DNA comprises a sequence encoding a protein 
comprising SEQ ID NO: 6. 

9. The method of claim 1, wherein the DNA encodes a protein comprising a sequence 
that differs from SEQ ID NO: 9 but has at least 70% sequence homology with SEQ 
ID NO:9 and the same function as the protein of SEQ ID NO: 9. 

10. The method of claim 1, wherein the DNA comprises a sequence encoding a protein 
comprising SEQ ID NO: 9. 

11. The method of claim 1, wherein the DNA is a DNA having a sequence as recited in 
SEQ ID NO: 1, or a variant thereof having at least 70% sequence identity to SEQ ID 
NO:l. 

12. The method of claim 1, wherein the DNA is a DNA having a sequence as recited in 
SEQ ID NO: 2, or a variant thereof having at least 70% sequence identity to SEQ ID 
NO: 2. 

13. The method of claim 1, wherein the DNA is a DNA having a sequence as recited in 
SEQ ID NO: 3, or a variant thereof having at least 70% sequence identity to SEQ ID 
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NO: 3. 

14. The method of claim 1, wherein the DNA is a DNA having a sequence as recited in 
SEQ ID NO: 4, or a variant thereof having at least 70% sequence identity to SEQ ID 
NO: 4. 

15. The method of claim 1 , wherein the DNA is a DNA having a sequence as recited in 
SEQ ID NO: 5, or a variant thereof having at least 70% sequence identity to SEQ ID 
NO: 5. 

16. A method of increasing the triacylglyceride content of an organism by transforming 
the organism with a vector, wherein the vector comprises a DNA encoding a protein 
comprising SEQ ID NO: 6, or a protein having the same function comprising a 
sequence having at least 70% homology with SEQ ID NO: 6. 

17. A method of increasing the triacylglyceride content of an organism by transforming 
the organism with a vector, wherein the vector comprises a DNA encoding a protein 
comprising SEQ ID NO: 7, or a protein having the same function comprising a 
sequence having at least 70% homology with SEQ ID NO: 7. 

18. A method of increasing the triacylglyceride content of an organism by transforming 
the organism with a vector, wherein the vector comprises a DNA encoding a protein 
comprising SEQ ID NO: 8, or a protein having the same function comprising a 
sequence having at least 70% homology with SEQ ID NO: 8. 

1 9. A method of increasing the triacylglyceride content of an organism by transforming 
the organism with a vector, wherein the vector comprises a DNA encoding a protein 
comprising SEQ ID NO: 9, or a protein having the same function comprising a 
sequence having at least 70% homology with SEQ ID NO: 9. 

20. A method of increasing the triacylglyceride content of an organism by transforming 
the organism with a vector, wherein the vector comprises a DNA encoding a protein 
comprising SEQ ID NO: 10, or a protein having the same function comprising a 
sequence having at least 70% homology with SEQ ID NO: 10. 

21. A non-human organism transformed with a DNA, wherein the DNA encodes a protein 
having GPAT activity, and the organism, after transforming, has enhanced ability to 
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produce triacylglycerides (TAGs). 

22. The non-human organism of claim 21, wherein the organism is a plant. 

23. The non-human organism of claim 21, wherein the organism is an oilseed bearing 
plant. 

24. The non-human organism of claim 22, wherein the plant is a member of the genus 
Bras sic a. 

25. The non-human organism of claim 21, that is Arabidopsis thaliana. 

26. The non-human organism of claim 21, wherein the organism is a yeast. 

27. The non-human organism of claim 21, wherein the DNA encodes a protein 
comprising SEQ ID NO: 6, or a protein having the same function comprising a 
sequence having at least 70% sequence homology with SEQ ID NO: 6. 

28. The non-human organism of claim 2 1 , wherein the DNA encodes a protein 
comprising SEQ ID NO: 7, or a protein having the same function comprising a 
sequence having at least 70% sequence homology with SEQ ID NO: 7. 

29. The non-human organism of claim 21, wherein the DNA encodes a protein 
comprising SEQ ID NO: 8, or a protein having the same function comprising a 
sequence having at least 70% sequence homology with SEQ ID NO: 8. 

30. The non-human organism of claim 21, wherein the DNA encodes a protein 
comprising SEQ ID NO: 9, or a protein having the same function comprising a 
sequence having at least 70% sequence homology with SEQ ID NO: 9. 

3 1 . The non-human organism of claim 2 1 , wherein the DNA encodes a protein 
comprising SEQ ID NO: 10, or a protein having the same function comprising a 
sequence having at least 70% sequence homology with SEQ ID NO: 10. 

32. The non-human organism of claim 21 , wherein the DNA is a DNA having a sequence 
as recited in SEQ ID NO: 1, or a variant thereof having at least 70% sequence identity 
to SEQ ID NO: 1. 

33. The non-human organism of claim 21, wherein the DNA is a DNA having a sequence 
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as recited in SEQ ID NO: 2, or a variant thereof having at least 70% sequence identity 
to SEQ ID NO: 2. 

34. The non-human organism of claim 21, wherein the DNA is a DNA having a sequence 
as recited in SEQ ID NO: 3, or a variant thereof having at least 70% sequence identity 
to SEQ ID NO: 3. 

35. The non-human organism of claim 21, wherein the DNA is a DNA having a sequence 
as recited in SEQ ID NO: 4, or a variant thereof having at least 70% sequence identity 
to SEQ ID NO: 4. 

36. The non-human organism of claim 21, wherein the DNA is a DNA having a sequence 
as recited in SEQ ID NO: 5, or a variant thereof having at least 70% sequence identity 
to SEQ ID NO: 5. 

37. A vector for genetically transforming an organism, wherein the vector comprises a 
DNA encoding a protein having GPAT activity, and the organism, after transforming, 
exhibits enhanced production of triacylglycerides. 

38. The vector of claim 37, wherein the vector comprises DNA encoding a protein 
comprising SEQ ID NO: 6, or a protein having the same function comprising a 
sequence having at least 70% sequence homology with SEQ ID NO: 6. 

39. The vector of claim 37, wherein the vector comprises a DNA encoding a protein 
comprising SEQ ID NO: 7, or a protein having the same function comprising a 
sequence having at least 70% homology with SEQ ID NO: 7. 

40. The vector of claim 37, wherein the vector comprises a DNA encoding a protein 
comprising SEQ ID NO: 8, or a protein having the same function comprising a 
sequence having at least 70% homology with SEQ ID NO: 8. 

41. The vector of claim 37, wherein the vector comprises a DNA encoding a protein 
comprising SEQ ID NO: 9, or a protein having the same function comprising a 
sequence having at least 70% homology with SEQ ID NO: 9. 

42. The vector of claim 37, wherein the vector comprises a DNA encoding a protein 
comprising SEQ ID NO: 10, or a protein having the same function comprising a 
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sequence having at least 70% homology with SEQ ID NO: 10. 

43. A method of modifying the fatty acid composition of triacylglycerides produced by an 
organism, wherein the organism is transformed with a DNA encoding a protein 
having GPAT activity. 

A marked-up copy of the claims showing the amendments made is attached. 



REMARKS 



The purpose of this Preliminary Amendment is to insert a cross-reference to related 
applications on the first page of the description and to amend the claims to eliminate multiple 
dependencies and to employ terminology more appropriate for U.S. prosecution. 

No new subject matter has been added. 




Edwin J. Gale 
Reg. No. 28,584 
Tel (613) 237-6900 
Our File No. 44838 
December 20, 2001 




iii e o r:: ;; ,., o ml 

09/926 




531 RecWCT/r . 



MARKED-UP VERSION OF THE CLAIMS 



2. 



3. 



4. 



5. 



6. 



7. 



8. 



A method [for] of increasing the triacylglyceride content of an organism 
[characterised by] which comprises expressing in the organism an introduced 
DNA encoding a protein having glycerol 3 -phosphate acy transferase (GPAT) 
activity. 

[A] The method [according to] of claim 1, [characterised in that] wherein the 
organism is a plant. 

[A] The method [according to] of claim 2, [characterised in that] wherein the 
plant is an oilseed bearing plant. 

[A] The method [according to] of claim 2, [characterised in that] wherein the 
plant is of the genus Brassica. 

[A] The method [according to] of claim 2, [characterised in that] wherein the 
plant is Arabidopsis thaliana. 

[A] The method [according to] of claim 1 , [characterised in that] wherein the 
organism is a yeast. 

[A] The method [according to any one of claims 1 to 6, characterised in that] of 
claim 1, wherein the DNA encodes a protein comprising a sequence that differs 
from SEQ ID NO: 6 but has at least 70% sequence homology with SEQ ID NO: 6 
and the same function as the protein of SEQ ID NO: 6. 

[A] The method [according to any one of claims 1 to 6, characterised in that] of 
claim 1 , wherein the DNA comprises a sequence encoding a protein comprising 
SEQ ID NO: 6. 

[A] The method [according to any one of claims 1 to 6, characterised in that] of 
claim I, wherein the DNA encodes a protein comprising a sequence that differs 
from SEQ ID NO: 9 but has at least 70% sequence homology with SEQ ID NO: 9 
and the same function as the protein of SEQ ID NO: 9. 



[A] The method [according to any one of claims 1 to 6, characterised in that] of 
claim 1, wherein the DNA comprises a sequence encoding a protein comprising 
SEQ ID NO: 9. 

[A] The method [according to any one of claims 1 to 6, characterised in that] of 
claim K wherein the DNA is a DNA having a sequence as recited in SEQ ID 
NO: 1, or a variant thereof having at least 70% sequence identity to SEQ ID 
NO: 1. 

[A] The method [according to any one of claims 1 to 6, characterised in that] of 
claim K wherein the DNA is a DNA having a sequence as recited in SEQ ID 
NO: 2, or a or a variant thereof having at least 70% sequence identity to SEQ ID 
NO: 2. 

[A] The method [according to any one of claims 1 to 6, characterised in that] of 
claim 1 , wherein the DNA is a DNA having a sequence as recited in SEQ ID 
NO: 3, or a variant thereof having at least 70% sequence identity to SEQ ID 
NO: 3. 

[A] The method [according to any one of claims 1 to 6, characterised in that] of 
claim K wherein the DNA is a DNA having a [a] sequence as recited in SEQ ID 
NO: 4, or a variant thereof having at least 70% sequence identity to SEQ ID 
NO: 4. 

[A] The method [according to any one of claims 1 to 6, characterised in that] of 
claim 1. wherein the DNA is a DNA having a sequence as recited in SEQ ID 
NO: 5, or a variant thereof having at least 70% sequence identity to SEQ ID 
NO: 5. 

A method [for] of increasing the triacylglyceride content of an organism by 
transforming the organism with a vector, [characterised in that] wherein the 
vector comprises a DNA encoding a protein comprising SEQ ID NO: 6, or a 
protein having the same function comprising a sequence having at least 70% 
homology with SEQ ID NO: 6. 
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17. A method [for] of increasing the triacylglyceride content of an organism by 
transforming the organism with a vector, [characterised in that] wherein the 
vector comprises a DNA encoding a protein comprising SEQ ID NO: 7, or a 
protein having the same function comprising a sequence having at least 70% 
homology with SEQ ID NO: 7. 

18. A method [for] of increasing the triacylglyceride content of an organism by 
transforming the organism with a vector, [characterised in that] wherein the 
vector comprises a DNA encoding a protein comprising SEQ ID NO: 8, or a 
protein having the same function comprising a sequence having at least 70% 
homology with SEQ ID NO: 8. 

19. A method [for] of increasing the triacylglyceride content of an organism by 
transforming the organism with a vector, [characterised in that] wherein the 
vector comprises a DNA encoding a protein comprising SEQ ID NO: 9, or a 
protein having the same function comprising a sequence having at least 70% 
homology with SEQ ID NO: 9. 

20. A method [for] of increasing the triacylglyceride content of an organism by 
transforming the organism with a vector, [characterised in that] wherein the 
vector comprises a DNA encoding a protein comprising SEQ ID NO: 10, or a 
protein having the same function comprising a sequence having at least 70% 
homology with SEQ ID NO: 10. 

21. A non-human organism transformed with a DNA, [characterised in that] wherein 
the DNA encodes a protein having GPAT activity, and the organism, after 
transforming, has enhanced ability to produce triacylglycerides (TAGs). 

22. [An] The non-human organism [according to] of claim 2 1 , [characterised in that] 
wherein the organism is a plant. 

23. [A] The non-human organism [according to] of claim 21, [characterised in that] 
wherein the organism is an [oil seed] oilseed bearing plant. 
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24. [A] The non-human organism [according to] of claim 22, [characterised in that] 
wherein the plant is a member of the genus Brass ica. 

25. [A] The non-human organism [according to] of claim 21 , that is Arabidopsis 
thaliana. 

26. [A] The non-human organism [according to] of claim 21, [characterised in that] 
wherein the organism is a yeast. 

27. [A] The non-human organism [according to any one of claims 2 1 to 26] of claim 
2L [characterised in that] wherein the DNA encodes a protein comprising SEQ 
ID NO: 6, or a protein having the same function comprising a sequence having at 
least 70% sequence homology with SEQ ID NO: 6. 

28. [A] The non-human organism [according to any one of claims 2 1 to 26] of claim 
21 , [characterised in that] wherein the DNA encodes a protein comprising SEQ 
ID NO: 7, or a protein having the same function comprising a sequence having at 
least 70% sequence homology with SEQ ID NO: 7. 

29. [A] The non-human organism [according to any one of claims 2 1 to 26] of claim 
21, [characterised in that] wherein the DNA encodes a protein comprising SEQ 
ID NO: 8, or a protein having the same function comprising a sequence having at 
least 70% sequence homology with SEQ ID NO: 8. 

30. [A] The non-human organism [according to any one of claims 21 to 26] of claim 
21, [characterised in that] wherein the DNA encodes a protein comprising SEQ 
ID NO: 9, or a protein having the same function comprising a sequence having at 
least 70% sequence homology with SEQ ID NO: 9. 

31. [A] The non-human organism [according to any one of claims 21 to 26] of claim 
21, [characterised in that] wherein the DNA encodes a protein comprising SEQ 
ID NO: 10, or a protein having the same function comprising a sequence having 
at least 70% sequence homology with SEQ ID NO: 10. 

32. [A] The non-human organism of claim 21, [according to any one of claims 21 to 
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26] of claim 21 , [characterised in that] wherein the DNA is a DNA having a 
sequence as recited in SEQ ID NO: 1, or a variant thereof having at least 70% 
sequence identity to SEQ ID NO: 1. 

33. [A] The non-human organism [according to any one of claims 21 to 26] of claim 
21, [characterised in that] wherein the DNA is a DNA having a sequence as 
recited in SEQ ID NO: 2, or a variant thereof having at least 70% sequence 
identity to SEQ ID NO: 2. 

34. [A] The non-human organism [according to any one of claims 21 to 26] of claim 
21, [characterised in that] wherein the DNA is a DNA having a sequence as 
recited in SEQ ID NO: 3, or a variant thereof having at least 70% sequence 
identity to SEQ ID NO: 3. 

35. [A] The non-human organism [according to any one of claims 21 to 26] of claim 
2_L [characterised in that] wherein the DNA is a DNA having a sequence as 
recited in SEQ ID NO: 4, or a variant thereof having at least 70% sequence 
identity to SEQ ID NO: 4. 

36. [A] The non-human organism [according to any one of claims 21 to 26] of claim 
21, [characterised in that] wherein the DNA is a DNA having a sequence as 
recited in SEQ ID NO: 5, or a variant thereof having at least 70% sequence 
identity to SEQ ID NO: 5. 

37. A vector for genetically transforming an organism, [characterised in that] 
wherein the vector comprises a DNA encoding a protein having GPAT activity, 
and the organism, after transforming, exhibits enhanced production of 
triacylglycerides. 

38. [A] The vector [according to] of claim 37, [characterised in that] wherein the 
vector comprises DNA encoding a protein comprising SEQ ID NO: 6, or a protein 
having the same function comprising a sequence having at least 70% sequence 
homology with SEQ ID NO: 6. 
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39. [A] The vector [according to] of claim 37, [characterised in that] wherein the 
vector comprises a DNA encoding a protein comprising SEQ ID NO: 7, or a 
protein having the same function comprising a sequence having at least 70% 
homology with SEQ ID NO: 7. 

40. [A] The vector [according to] of claim 37, [characterised in that] wherein the 
vector comprises a DNA encoding a protein comprising SEQ ID NO: 8, or a 
protein having the same function comprising a sequence having at least 70% 
homology with SEQ ID NO: 8. 

41 . [A] The vector [according to] of claim 37, [characterised in that] wherein the 
vector comprises a DNA encoding a protein comprising SEQ ID NO: 9, or a 
protein having the same function comprising a sequence having at least 70% 
homology with SEQ ID NO: 9. 

42. [A] The vector [according to] of claim 37, [characterised in that] wherein the 
vector comprises a DNA encoding a protein comprising SEQ ID NO: 10, or a 
protein having the same function comprising a sequence having at least 70% 
homology with SEQ ID NO: 10. 

43. A method [for] of modifying the fatty acid composition of triacylglycerides 
produced by an organism, [characterised in that] wherein the organism is 
transformed with a DNA encoding a protein having GPAT activity. 
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OVEREXP RESS ION I N YEAST AND PLANTS OF A GEN E ENCODING 
GLYCEROL 3-PHOSPH ATE ACYLTRANSFERASE 



TECHNICAL FIELD 

The invention relates to the production of fats and oils for commercial and industrial 
uses. More particularly, the invention relates to a process by which natural oil or fat 
levels in organisms may be increased, fatty acid composition of triacylglycerides 
may be altered, and to nucleotide sequences which may be introduced into organisms 
to cause the increase, and plasmids, vectors, etc. useful in the process. 

BACKGROUND ART 

Non-animal-source oils are produced mainly for edible purposes but their use in non- 
edible applications is expected to increase due to declining fossil fuel supply 
1 (Kinney, 1998). More than 65 million metric tons of vegetable oils are produced 
currently with a total value of $US 25 billion 23 (Browse et al., 1998; Murphy, 1999). 
World demand for vegetable oils has increased by 300% since 1960. Further, the 
market share of animal-derived fats has declined from 39% of the total in 1960 to 
26% in 1990. All of these factors have contributed to a demand by industry for 
higher vegetable oil content in plant seeds in order to be cost effective during 
production and processing 4 (Bright and Hawkes, 1998). Biotechnology offers 
avenues for meeting this demand through identification and manipulation of the 
biochemical pathways that lead to oil production. 

Glycerol-3-phosphate acyltransferase (GPAT) catalyses the first reaction in 
triacylglyceride synthesis via the Kennedy pathway. It uses glycerol-3 -phosphate 
(G-3-P) and acyl-coenzyme A (acyl-CoA) thioesters to synthesise lysophosphatidic 
acid (LP A). In fat storing organs e.g., seeds and adipose tissues, the remaining 
reactions are catalysed by a /ysophosphatidic acid acyltransferase (LPAAT), 
phosphatidic acid phosphatase (PAPase) and diacylglycerol acyltransferase 
(DAGAT). Thus far, LPAATs and DAGATs have been the foci of studies on TAG 
biosynthesis. The role of GPAT has received less attention. Several studies have 
shown that PAPase controls the rate limiting step of TAG biosynthesis in mammals 



and yeast and DAGAT is the rate limiting enzyme in plants (Perry and Harwood, 
1994). 



Only a limited number of genes have been reported to enhance oil content (for 
example, Zou et al. 2 , 1997, Zou et al. 3 , 1998). There is a need to explore other 
5 approaches using other genes which might be more effective alone or in combination 
with previously cloned genes. 

Aplastidial GPAT gene has been used by others to change the fatty acid composition 
of membrane lipids and to improve chilling tolerance (Murata et al. 4 , 1992 and 
Nishizawa 5 , 1996). The bacterial GPAT gene (plsBJ has also been used to change 
10 the fatty acid composition of membrane lipids and to decrease chilling tolerance 

(Wolter et al. 6 , 1992). No study has yet used GPAT genes to increase oil content in 
an organism. 

In an article entitled "Engineering and breeding of new oil crops — Aclytransferases 
from basic science to modified seed oils", Fett/Lipid 100 (1998), Nr. 4-5, S. 161-166, 

15 Margrit Frentzen discloses that different discrete acyltransferases are involved in 
glycerlipid biosynthesis in plant cells. Specifically, chimeric l-acylglycerol-3- 
phosphate acyltransferase genes have been successfully utilized to achieve the 
synthesis of rapeseed oil with homogeneous fatty acid distributions desired for 
industrial applications. Moreover, it is mentioned that evidence has been provided 

20 that expression of acyltransferase genes in transgenic rapeseed plants can improve 
not only oil quality but also oil yield. 

PCT patent application WO 92/1 3082, which was filed by Kirin Beer KK and was 
published on August 6, 1992, discloses the genetic engineering of higher plants to 
confer chilling resistance. This involves producing a plant containing more than 
25 normal unsaturated fatty acids in membrane lipids. A preferred embodiment of such 
a plant is a transgenic plant expressing a polypeptide with a glycerol 3-phosphate 
acyltransferase activity that has a higher substrate selectivity for oleoyl-ACP than for 
palmitoyl-ACP. Specifically, an Atase gene from A. thaliana was expressed in 
transgenic tobacco plants. 
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DISCLOSURE OF THE INVENTION 

It is an object of the invention to provide a method for increasing the oil content of 
an organism. 

It is a further object of the invention to provide a genetically altered organism with 
enhanced GPAT activity, relative to the wild type. 

It is a further object of the invention to produce an organism having increased oil or 
fat production capability, relative to the wild type. 

Another object of the invention is to produce DNA clones, constructs and vectors 
suitable for modifying the genomes of organisms to increase the production of 
10 triacylglycerides (TAGs), relative to the wild type. 

An additional object of the invention is to produce an organism having an altered 
fatty acid composition in its triacylglycerides, relative to the wild type. 

Still a further object of the invention is to identify, isolate and clone a genetic 
element that may be used to modify the natural formation of triacylglycerols in plants 

15 




20 
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in order to increase the yield of commercial plant oils, or to modify their 
composition to achieve specific commercial improvements of plants and plant 
products. 

In a first aspect, the invention provides a method for increasing the oil content of an 
organism by inserting in the organism a DNA encoding a protein having glycerol 3- 
phosphate acyltransferase activity. 

In a second aspect, the invention provides an organism transformed with a DNA, 
wherein the DNA encodes a protein having GPAT activity, and the organism, after 
transforming, has enhanced ability to produce triacylglycerides. 

In a third aspect, the invention provides a vector for genetically transforming an 
organism, wherein the vector comprises a DNA encoding a protein having GPAT 
activity, and the organism, after transforming, exhibits enhanced production of 
triacylglycerides. 

In a fourth aspect, the invention provides a method for modifying the fatty acid 
composition of triacylglycerides produced by an organism/wherein the organism is 
transformed with a DNA encoding a protein having GPAT activity. 

All organisms have within their genome a gene encoding a protein having GPAT 
activity. The invention relates to a method for expressing in an organism at least one 
additional DNA sequence encoding a protein having GPAT activity. 

The inventors chose to target GPAT because of its role in the Kennedy pathway, a 
pathway that is common to all organisms. 

In a preferred embodiment, the invention pertains to micro-organisms. In a 
particularly preferred embodiment, to yeasts and plants. 

The method of the invention is particularly suited to the production of oil seed plants 
having enhanced TAG content, or having modified fatty acid composition in their 
TAGs. The terms "seed oil plant" and "oil seed crop" are meant to encompass any 
plant or crop from which the oil may be isolated in marketable quantity. Some plants 
or crops having TAGs with particularly interesting fatty acid composition are grown 
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for the production of TAGs, even though the lipid content is low (e.g. less than 1 
wt%). The method of the invention may be used in such plants to increase the 
content of TAG. Preferred plants or crops are those having a seed lipid content of at 
least 1 wt% (in the wildtype). Some illustrative examples of oil seed crops are as 
5 follows (trivial names are given in parentheses): 

Borago officinalis (Borage); Brassica species, for example mustards, canola, rape, B. 
campestris, B, napus, B. rapa; Cannabis sativa (Hemp, widely uses as a vegetable oil 
in Asia); Carthamus tinctorius (Safflower); Cocos nucifera (Coconut); Crambe 
abyssinica (Crambe); Cuphea species {Cuphea produce medium chain fatty acids of 

10 industrial interest); Elaeis guinensis (African oil palm); Elaeis oleifera (American oil 
palm); Glycine max (Soybean); Gossypium hiristum (Cotton - American); 
Gossypium barbadense (Cotton - Egyptian); Gossypium herbaceum (Cotton - 
Asiatic); Helianthus annus (Sunflower); Linum usitatissimum (Linseed or flax); 
Oenethera biennis (Evening primrose); Olea europea (Olive); Oryza sativa (Rice); 

15 Ricinus communis (Castor); Sesamum indicum (Sesame); Soja max (Soybean - note 
Glycine max is the major species); Triticum species (Wheat); and Zea maize (Corn). 

Three types of plant GPAT have been reported: plastidial (P), mitochondrial (M), 
and cytosolic (ER). They exhibit different specificities towards acyl-ACP and acyl- 
CoA derivatives of fatty acids !1 (Frentzen, 1993). The P-GPAT is mainly concerned 

20 with phospholipid biosynthesis in chloroplasts and has been shown to use both acyl- 
ACPs and acyl-CoAs as substrates, although the latter with a lower efficiency 12 
(Wilkinson and Bell, 1997). The ER form of GPAT is the most important for TAG 
biosynthesis but a plant gene has not yet been cloned. The ER-GPAT uses acyl- 
CoAs. The GPAT from the enteric bacterium Escherischia coli can use both acyl- 

25 ACP and acyl-CoA equally well 13 ( Wilkinson and Bell, 1997). 

As will be appreciated by persons skilled in the art, the invention also relates to 
substantially homologous DNA sequences from plants encoding proteins with 
deduced amino acid sequences of 25% or greater identity, and 40% or greater 
similarity, isolated and/or characterized and/or designed by known methods using the 
30 sequence information of SEQ ID NO: 1 or SEQ ID NO:2 or SEQ ID NO: 3 or SEQ ID 
NO:4 or SEQ ID NO:5, and to parts of reduced length that are still able to function as 
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inhibitors of gene expression by use in an anti-sense, co-suppression (Transwitch; 
14 Jorgensen and Napoli 1994) or other gene silencing technologies. It will be 
appreciated by persons skilled in the art that small changes in the identities of 
nucleotides in a specific gene sequence may result in reduced or enhanced 
5 effectiveness of the genes and that, in some applications (e.g. anti-sense or co- 
suppression), partial sequences often work as effectively as full length versions. The 
ways in which the gene sequence can be varied or shortened are well known to 
persons skilled in the art, as are ways of testing the effectiveness of the altered genes. 
All such variations of the genes are therefore claimed as part of the present invention. 

10 Other preferred degrees of identity to the indicated sequences for both DNA and 

protein sequences are at least 30%, 40%, 50%, 60%, 70%, 80%, 90% and 95%; and 
other preferred degrees of similarity are at least 50%, 60%, 70%, 80%, 90% and 
95%. The inventors have used a computer program known as MegAlign®, 
DNASTAR® of DNASTAR Inc., 1228 South Park Street, Madison, WI 53715, 

15 USA, for assessing homology. This program is based on the Clustal V algorithm 
(Higgins and Sharp, 1998): A package for performing multiple sequence alignment 
on a microcomputer; GENE 73:237-244). For each gap introduced in the alignment, 
the program deducts a penalty from the score. A higher gap penalty suppresses 
gapping; a lower value promotes it. The program also assesses penalties based on 

20 the length of the gap. The more residues the gap spans, the greater the penalty. The 
program deducts these penalties from the overall score of the alignment. 

When considering altered oil contents or compositions, results from averages of 
statistically-significant numbers of plants or seeds according to the invention are best 
compared with results from averages of statistically-significant numbers of 

25 untransformed (wild-type) plants or seeds of the same genotype grown under 

identical conditions at the same time. This allows for the variability of individual 
plants of the same genotype, particularly when such plants are grown under different 
conditions. The actual number of plants or seeds used to form the required average 
may vary, but should be enough to provide a generally constant average whenever 

30 such number is selected. Generally, the number should be at least 10, and is more 
preferably at least 20, 30, 50 or 100. 



WO 00/78974 



6 



PCT/CAOO/00738 



Alternatively, the oil contents or compositions may be compared with plants of the 
same species transformed with an open vector (the same vector as that used for the 
introduction of the DNA of the invention, but with such DNA omitted), grown under 
identical conditions at the same time. Again, an average of results from a number of 
such plants, as well as plants transformed according to the present invention, is 
preferred (the numbers being the same as those indicated above). 

The GPAT of the current invention is useful in manipulating GPAT activity, and 
triacylglycerol bioassembly in plants. For example, by transforming plants with a 
construct containing the GPAT gene in a sense orientation, possibly under the control 
of a tissue-specific promoter, the expression of GPAT and accumulation of seed oil 
can be enhanced or the acyl composition of the seed oil altered. Yet another example 
would be to express the GPAT cDN A under the control of a constitutive promoter 
(e.g. 35S ; 15 Datla et ah, 1993) to increase the TAG content of vegetative tissues 
(leaves, roots, stems). This may have particular advantages for altering the starch/oil 
ratio in root crops. 

Alternatively, GPAT expression can be silenced to some degree by anti-sense or co- 
suppression (Transwitch) phenomena ( 16 De Lange et al., 1995; 17 Mol et al., 1990; 
18 JorgensenandNapoli, 1994; 19 Kirmey, 1995; 20 Vaucheret et al, 1998; 21 Taylor, 
1998). For example, silencing GPAT in a seed specific manner may result in a 
20 reduction in TAG accumulation. This could have applications, for example, in 
reducing the oil content in seed barley to enhance stability during storage. As a 
second example, seed-specific silencing may lead to a relatively high accumulation 
of DAG or an increase in the DAG/TAG ratio in the developing or mature seed. 

Some of the manipulations and deliverables which are possible using the GPAT gene 
25 or a part thereof, include, but are not limited to, the following: seeds with increased 
or decreased oil content; seeds containing oils with an enhanced diacylglycerol 
content, seed oils with an altered acyl composition; plants producing larger or 
heavier seeds; plants exhibiting an enhanced or altered capacity to accumulate 
storage compounds in other storage organs (e.g. tubers, roots). 
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BRIEF DESCRIPTION OF THE BRA WINGS 

Preferred embodiments of the invention are illustrated with the help of the drawing: 

Figure 1 shows the Seed oil content of wild type (Wt) Arabidopsis thaliana, and A. 
thaliana transformed with: vector only (pHS737); the unmodified GPAT encoding 
5 DNA from safflower (ctpGPA); the GPAT encoding DNA from safflower from 

which the transit peptide has been deleted (ctpGPA-TP); the GPAT encoding DNA 
from safflower from which the transit peptide has been deleted and the ER retention 
signal has been added (ctpGPA+ERRS); the GPAT encoding DNA from Escherichia 
coli (plsB); and the GPAT encoding DNA from Escherichia coli to which the ER 
10 retention signal has been added (plsB+ERRS). 

BEST MODES FOR CARRYING OUT THE INVENTION 

The inventors chose to use the well-accepted model plant system Arabidopsis 
thaliana for the cloning of GPAT, as a host system for genetic engineering to alter 
GPAT expression, and to study the effects of altering GPAT expression on seed 

15 triacylglycerol bioassembly. Over the past several years, Arabidopsis thaliana, a 
typical flowering plant, has gained increasing popularity as a model system for the 
study of plant biology. As a result of the ease with which this plant lends itself to 
work in both classical and molecular genetics, Arabidopsis has come to be widely 
used as a model organism in plant molecular genetics, development, physiology and 

20 biochemistry ( 22 Meyerowitz and Chang, 1985; 23 Meyerowitz, 1987; 24 Goodman et 
al., 1995). This model dicotyledonous plant is also closely related to Brassica crop 
species and it is increasingly apparent that information concerning the genetic control 
of basic biological processes in Arabidopsis will be transferable to other species 
( 25 Lagercrantz et al., 1996). 

25 Indeed, there are numerous examples wherein studies of the molecular biology and 
biochemistry of a particular metabolic pathway or developmental process and the 
possibility of genetically engineering a plant to bring about changes to said metabolic 
pathway or process, has first been tested in the model plant Arabidopsis, and then 
shown to yield similar phenotypes in other plants, particularly crop plants. 
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For example, the extra- plastidial membrane associated oleate (18:1) A12 (co-6) 
desaturase gene, FAD2, was originally studied and eventually cloned from 
Arabidopsis thaliana, by identifying the lesion found in an A. thaliana mutant 
defective in desaturating oleate to produce linoleate (18:2) on the 
5 phosphatidylcholine backbone. This resulted in a high oleic acid phenotype in the A. 
thaliana seed oil ( 26 Okuley et al., 1994). Genetic engineering of both soybean 
(Glycine max,) and canola B. napus to silence the indigenous FAD2 gene(s) in a 
seed-specific manner by anti-sense or co-suppression approaches, resulted in similar 
high oleic acid seed oil phenotypes ( 27 Kinney, 1995; 1997). 

10 Transgenic expression of a yeast sn-2 acyltransferase (SLC1-1) gene to achieve 
modified seed oil content and enhance very long-chain fatty acid content was first 
performed in Arabidopsis and later shown to yield similar phenotypes in transgenic 
rapeseed (B. napus) experiments ( Zou et al., 1997). Arabidopsis thaliana has 
repeatedly shown itself to be a useful model system for metabolic engineering of 

15 metabolic pathways (e.g. lipid biosynthesis, photosynthesis) or processes 

(organogenesis, reproductive development etc.) common to all higher plants. 

In the area of secondary metabolism/signal transduction, an anthocyanin pathway- 
specific transcriptional activator from the monocot maize designated as R (the myc 
transcription factor involved in activation of biosynthetic genes for anthocyanin 

20 production in the aleurone cells of maize kernels), was expressed in the dicot 

Arabidopsis, causing augmented anthocyanin pigmentation in the infloresecences. 
Subsequent expression in another dicot, tobacco (Nicotiana tabacum), resulted in 
similar floral pigmentation changes ( 29 Lloyd et al., 1992). These experiments 
demonstrate that whole pathways common to all flowering plants can be co- 

25 ordinately controlled through the introduction of transcriptional regulators, and that 
the mechanisms are common to diverse plant species. 

In the context of the current invention, all plant seeds accumulate some 
triacylglycerol (oil) and this ubiquitous process is affected, at least in part, by the 
activity of GPAT, as explained previously. Thus, many of the effects observed 
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following genetic engineering to modulate GPAT expression in Arabidopsis can be 
expected to result in similar phenotypes when carried out in all other plants. 

There are a number of ways by which genes and gene constructs can be introduced 
into plants, and a combination of plant transformation and tissue culture techniques 
5 have been successfully integrated into effective strategies for creating transgenic 
crop plants. These methods, which can be used in the present invention, have been 
extensively reviewed elsewhere ( 30 Potrykus, 1991; 31 Vasil, 1994; 32 Walden and 
Wingender, 1995; 33 Songstad et al., 1995), and are well known to persons skilled in 
the art. For example, one skilled in the art will certainly be aware that, in addition to 

10 Agrobacterium-mediated transformation of Arabidopsis by vacuum infiltration 

( 34 Bechtold et al., 1993) or wound inoculation ( 35 Katavic et aL, 1994), it is equally 
possible to transform other plant and crop species, using Agrobacterium Ti-plasmid- 
mediated transformation (e.g. hypocotyl; 36 DeBlock et al., 1989) or cotyledonary 
petiole ( 37 Moloney et al, 1989) wound infection), particle bombardment/biolistic 

15 methods ( 38 Sanford et al., 1987; 39 Nehra et al., 1994; 40 Becker et al., 1994) or 
polyethylene glycol-assisted protoplast transformation ( 41 Rhodes et al., 1988; 
42 Shimamoto et al., 1989) methods. 

As will also be apparent to persons skilled in the art, and as extensively reviewed 
elsewhere ( 43 Meyer, 1995; ^Datla et al., 1997), it is possible to utilize plant 

20 promoters to direct any intended up- or down-regulation of transgene expression 

using constitutive promoters (e.g. those based on CaMV35S), or by using promoters 
which can target gene expression to particular cells, tissues (e.g. napin promoter for 
expression of transgenes in developing seed cotyledons), organs (e.g. roots), to a 
particular developmental stage, or in response to a particular external stimulus (e.g. 

25 heat shock). 

Particularly preferred plants for modification according to the present invention 
include Arabidopsis thaliana, borage (Borago spp.), Canola, castor (Ricinus 
communis), cocoa bean (Theobroma cacao), corn (Zea mays), cotton (Gossypium 
spp), Crambe spp., Cuphea spp., flax (Linum spp.), Lesquerella and Limnanthes spp., 
30 Linola, nasturtium (Tropaeolum spp.), Oenothera spp., olive (Olea spp.), palm 

(Elaeis spp.), peanut (Arachis spp.), rapeseed, safflower (Carthamus spp.), soybean 
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(Glycine and Soja spp.), sunflower (Helianthus spp.), tobacco (Nicotiana spp.), 
Vernonia spp., wheat (Triticum spp.), barley (Hordeum spp.), rice (Oryza spp.), oat 
(Avena spp.) sorghum (Sorghum spp.), rye (Secale spp.) or other members of the 
Gramineae. 

5 The present invention is particularly useful when used to modify the yield or 

composition of oilseed produced from oilseed crops. Oilseed crops are plant species 
that are capable of generating edible or industrially useful oils in commercially 
significant yields, and include many of the plant species listed above. Such oilseed 
crops are well known to persons skilled in the art. 

10 Once a transgenic oilseed plant has been produced according to the present 

invention, it can be grown and harvested in conventional ways. Oil may be extracted 
from harvested seed by collecting and crushing the seed, and/or by methods of 
solvent extraction, in which the crushed seeds are contacted with a solvent for the oil 
and the resulting solution is filtered off or decanted and the solvent removed. Other 

15 conventional and traditional methods of oil extraction may be used, if desired. 

The method of the invention encompasses the transformation of any organism with a 
DNA encoding a protein having GPAT activity. 

As an example of the method of the invention, the inventors have demonstrated the 
role of GPAT in regulating the amount of TAG by expressing, in yeast and in the 
20 plant Arabidopsis thaliana, a P-GPAT gene (ctpgpaf) from safflower 45 (Bhella and 
MacKenzie, 1994) and the GPAT gene (plsB) from E, coli. 

Normally, plastidial proteins encoded by nuclear genes are targeted to plastids by a 
transit peptide (tp). Removal of the tp will confine such proteins to the cytosol. 
Since the enzymes of TAG biosynthesis are present in the endoplasmic reticulum 
25 (ER) and TAGs are synthesised at the ER, an ER retention signal 46 (errs; Jackson et 
al., 1990) which has been shown to target many heterologous proteins including 
E.coli LPAAT to the ER (Weier et aL 47 , 1998) was used to target P-GPAT without a 
tp. The plsB gene was used as such and also with an added errs sequence. 
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It is generally accepted in the art that proteins having 60% or greater sequence 
homology will have identical functionalities. Nonetheless, many cases are known in 
which far lower sequence homologies (e.g. 25 to 30%) exist and yet the proteins 
have identical functionalities. 

5 The particular genes mentioned above will have numerous homologous variants, by 
virtue of the degeneracy of the genetic code. The specific examples described to 
illustrate the invention obviously relate as well to such homologous variants. 

The use of heterologous genes (those GPATs taken from other species) may be 
particularly advantageous, because the encoded proteins may not be subject to 
10 regulation (such as feedback inhibition, or inhibition by native inhibitors) in the same 
way as the native GPAT. 

The inventors used the vector pYES2 (Invitrogen) for transformation of yeast, and 
the vector pHS737, for transformation of Arabidopsis thaliana. Examples of other 
vectors are: 

15 Yeast vectors: 

pYeDP60 (Urban P, Cullin C, Pompon D 1991. Maximizing the expression of 
mammalian cytochrome P-450 monooxygenase activities in yeast cells. Biochimie 
72: 463-472); pCGS109 (Botstein D, David RW, Fink GR, Taunton-Rigby A, 
Knowlton RG, Mao J-I, Moir DT, Goff CG 1987. GAL 1 yeast promoter linked to 
20 non galactokinase gene. US patent No. 4661454); pYEUra3 (Clontech). 

Plant vectors: 

pHS737 and pHS738 (Selvaraj and Hirji; unpublished); pRD400 (Datla RS, 
Hammerlindl JK, Panchuk B, Pelcher LE, Keller W. 1992. Modified binary plant 
transformation vectors with the wild-type gene encoding NPTIL Gene 122:383- 
25 384.); pBinl9 (Frisch DA, Harris-Haller LW, Yokubaitis NT, Thomas TL, Hardin 
SH, Hall TC. 1995. Complete sequence of the binary vector Binl9. Plant Mol Biol 
27:405-409.); pCGN3223 (Roesler K, Shintani D, Savage L, Boddupalli S, Ohlrogge 
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JB (1997) Targeting of the Arabidopsis homomeric acetyl-coenzyme A carboxylase 
to plastids of rapeseeds. Plant Physiol 113: 75-81). 

MA TERIALS AND METHODS 

Preparation of Chimeric Genes and Expression Vectors. 

5 An open reading frame (orf 9 -1.1 kb) without tp was amplified by PCR from the 

ctpgpat cDNA. Another chimeric gene containing an errs at its 3* end was also PCR 
amplified. The orf of the E.coliplsB gene (-2.5 kb) was PCR amplified from 
bacterial DNA without modification or with an errs at its 3* end. The blunt-end PCR 
fragments generated using Pfu DNA polymerase were cloned into pSK II 

10 (Strategene) cloning vector and were sequenced to confirm the nucleotide sequence 
as well as incorporation of restriction sites and errs sequences into the chimeric 
genes. The modified genes were labelled as ctpgpat-tp, ctpgpat- tp+ errs \ plsB and 
plsB+errs. The intact ctpgpat cDNA was also used (Bhella and MacKenzie 48 , 1994). 
The sequences of the modified and unmodified genes are shown in SEQ. ID. NOS. 1 

15 to 5. 



Transformation of a Yeast 

For the yeast expression study, the ctpgpat or plsB chimeric genes were retrieved as 
BamHl or Bglll segments, respectively, and were cloned into the BamHI site of the 
yeast expression vector, pYES2 (Invitrogen), under the transcriptional control of a 
20 galactose inducible promoter (GAL1). 

The INVScl strain (Invitrogen) of yeast was transformed with the above 
recombinant constructs by the heat shock method (Elble 49 , 1992) to assess the 
functionality of the genes and the derived proteins. 



Enzyme Assay 



25 



Yeast cells containing chimeric GPAT genes were grown in SC-Ura (Bio 101) 
containing glucose. A 2.5 mL culture was initiated and grown for 18 hr at 28°C. A 
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fresh 10 mL culture was grown by adding equal number of yeast cells to this culture 
and grown for another 24 hr, GPAT gene expression was induced by transferring 
cells to growth medium containing galactose as follows : the cells were then 
transferred to 10 mL SC-Ura and galactose and the GPAT gene expression was 
5 induced for 24 hr. The yeast cells were in stationary phase by then. Cells were 
either used for protein extraction for enzyme assay or for lipid analysis. 

Total proteins from the control and GPAT overexpressing cells were extracted by the 
glass bead or French Press method, and GPAT activity was assayed (Eccleston and 
Harwood 50 , 1995). The products of the reaction were separated into LP A, PA, and 
10 DAG and TAG by thin layer chromatography (Zou et al. 51 , 1999) and identified 

using standards. Total lipids in control and cells in which GPAT was expressed were 
measured by *H NMR (Rutar 52 , 1989) and by gas liquid chromatography. 

GPAT Activity in Yeast 

All of the GPAT genes used produced functionally active protein when expressed by 
15 a galactose inducible promoter in yeast cells. Extracted GPATs were assayed for 
activity in vitro, by looking at the production of lysophosphatidic acid, and the 
results are listed in Table 1. 



TABLE 1: FORMATION OF LYSOPHOSPHATIDIC ACID BY ISOLATED 
CPA TS EXPRESSED IN YEAST IN VITRO 



Construct 


substrate 18:1 -Co A 
(pmol/min/mg) 


Substrate 16:0-CoA 
(pmol/min/mg) 


glucose 


Galactose 


Glucose 


galactose 


PYES2 (vector) 


10.0 


6.8 


192 


131 


pYES2:ctpgpat-tp 


9.6 


33.0 


229 


175 


pYBS2:ctpgpat-tp 
+errs 


13.7 


44.4 


138 


157 
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PYES2: plsB 


10.2 


28.5 


196 


790 


pYES2: plsB+errs 


12.4 


20.1 


211 


346 



Increased GPAT activity lead to more total lipid production in the in vitro assay, as 
shown in Table 2. 

Increased GPAT activity leads to enhanced lipid production in vitro and an increase 
5 in the proportions of all Kennedy pathway intermediates. The plsBs were able to use 
both 16:0-CoA and 18:1 -Co A, the former with several fold higher efficiency. The 
ctpgpats could use only 18:1 -Co A but with greater efficiency than the plsBs. 

The transformants produce more lipids in vivo, relative to the control, as shown in 
Tables 3 & 4. 

10 TABLE 2: FORMA TION OF TOTAL LIPIDS BY ISOLA TED CPA TS IN 

VITRO 



Construct 


substrate 18:l-CoA 


Substrate 16:0-CoA 






(pmol/min/mg) 




(pmol/min/mg) 






glucose 


galactose 


ratio* 


glucose 


galactose 


ratio* 


PYES2 
(vector) 


116 


112 


0.96 


1140 


513 


0.44 


pYES2:ctpg 
pat-tp 


184 


395 


2.15 


1170 


868 


0.74 


pYES2:ctpg 
pat-tp 


175 


341 


1.95 


717 


771 


1.07 


+errs 














PYES2: 
plsB 


98 


247 


2.51 


789 


2990 


3.77 
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Construct 


substrate 18:l-CoA 
(pmol/min/mg) 


Substrate 16:0-CoA 
(pmol/min/mg) 


glucose 


galactose 


ratio* 


glucose 


galactose 


ratio* 


PYES2: 
pls3+errs 


152 


182 


1.19 


1250 


1830 


1.46 



galactose/glucose 



TABLE 3: ESTIMATION OF OIL CONTENT IN CONTROL AND 
TRANSFORMED YEASTS BY 1 H NMR 



Construct 


'H NMR response 


Ratio of oil contents* 


% increase 


INVSc (wild type) 


0.81 






PYES2 (vector) 


1.12 


1.00 




PYES2: ctpgpat-tp 


1.3 


1.17 


17 


PYES2: 
ctpgpat-tp+errs 


1.68 


1.51 


51 


PYES2-./7&B 


1.52 


1.37 


37 


PYES2: plsB+errs 


1.22 


1.10 


10 



* cells grown in galactose/cells grown in glucose 



5 TABLE 4: DETERMINA TION OF OIL CONTENT OF WILD TYPE AND 

TRANSFORMED YEASTS BY GAS CHROMATOGRAPHY 



construct 


glucose (wt%) 


Galactose (wt%) 


INVSc (wild type) 


4.4 


4.4 


PYES2 (vector 


5.0 


6.2 


PYES2: ctpgpat-tp 


5.5 


8.8 


PYES2: ctpgpat-tp+errs 


6.0 


7.4 


plsB 


7.5 


7.6 


plsll+errs 


5.0 


6.2 



Legend to Tables 1 to 4: "glucose" and "galactose" indicate cells grown on glucose 
medium, and those grown on medium containing galactose (to induce the promoter), 
respectively. 
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N.B. comparison in Table 4 can only be made within a construct, as the cells in 
different constructs may be at a different growth stage. 

According to the process of the invention, lipid content is increased by enhancing 
GPAT activity. No manipulation of growth medium, growth conditions or substrates 
5 is required to achieve a higher lipid content. 

Overexpression of GPATS having selectivity for specific fatty acids could be used to 
enhance the content of speciality oils in micro-organisms. 

GPAT Activity in A. thaliana 

For plant expression study, chimeric genes were cloned into the BamHl site of the 
10 plant transformation vector, pHS737, under the control of a tandem 35S CaMV 
promoter with AMV translational enhancer and 35S poly A for constitutive 
expression. The recombinants were transferred into Agrobacterium tumefaciens GV 
3101 for Arabidopsis thaliana transformation. 

Arabidopsis plants were transformed by the floral dip method (Clough and Bent 53 , 
15 1998). Seeds (Tj) from these plants were collected and selected on a growth medium 
containing kanamycin. Transgenic plants were grown to maturity and seeds (T 2 ) 
from 10 individual plants were collected and used for lipid analysis. Wild type and 
plants transformed with vector alone were grown as controls along with the 
transformed plants. 

20 Lipid Analysis 

The fatty acid composition of seeds was determined by GC analysis following 
extraction of the oil and conversion of the triglycerides to fatty acid methyl esters. A 
known amount of CI 5 triglyceride was added to the seed sample as a tracer before 
oil extraction. Total seed lipid content was estimated on the basis of the recovery of 
25 CI 5 fatty acid methyl ester. CI 7:0 methyl ester was used as an internal standard for 
the chromatography. Fatty acid methyl esters were analysed using an HP 5850 gas 
chromatograph equipped with a DB-23 column (30m X 0.25mm; J & W Scientific, 
Folsom, CA). The GC conditions were: injector temperature and flame ionisation 
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detector temperature, 250°C After an initial hold at 180°C for 1 min, the oven 
temperature was programmed to 240°C at 4°C/min and held at this temperature for 
10 min. 

Most of the transgenic plants appeared normal in morphology. The oil content and 
5 seed sizes of selected lines representing each construct are shown in Table 5. 



TABLE 5: OIL CONTENT OF WILD TYPE AND TRANSFORMANT 
ARABIDOPSIS THALIANA SEEDS 



v^onsxruci 


tran s i ormani 
reference no. 


Oil content 


wt/100 seeds 
(mg) 


Wt% 


% increase 


wild type 


average i ) 


OK A i A O 

zo.v ± U.o 




\ A O _l_ A 

1.42 ± 0.29 












^vector ) 


average ^n — / ) 


OK O _i_ l An 

26.© ± i .oy 




1 /I 1 -L. /*! OK 

1.41 ± U.2o 














315-2 


2.4 


20.9 


2.47 




315-3 


29.4 


9.7 


1.92 




315-4 


30.1 


12.3 


1.90 




315-7 


29.6 


10.4 


1.85 












ctpgpat-tp 


301-2 


34.0 


26.9 


2.03 




301-3 


34.6 


29.1 


1.90 




301-5 


29.3 


9.3 


1.79 












ctpgpat-tp 
+errs 


302-2 


29.4 


9.7 


1.44 




302-6 


32.7 


22.0 


2.45 












plsB 


303-2 


29.8 


11.2 


1.28 




303-3 


30.8 


14.9 


1.98 




303-4 


29.1 


8.6 


1.89 
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Construct 


transformant 
reference no. 


Oil content 


wt/1 00 ^ppH^ 

W \J L \J\J oLLUO 

(mg) 


Wt% 


% increase 




303-7 


33.2 


23.9 


1.53 












plsB+errs 


304-1 


30.9 


15.2 


1.51 




304-2 


32.5 


21.3 


1.60 




304-3 


30.3 


13.1 


1.38 




304-15 


32.5 


21.3 


2.20 



Seeds of plants transformed using only the pHS737 vector were indistinguishable in 
oil content from wild type control plants grown under the same conditions. All other 
gene constructs produced higher seed oil content. The unmodified ctpgpat, which 
5 would be expected to be expressed in the plastid, produced oil increases ranging 
from 10 to 21%. This suggests that LPA is released from the plastids and 
subsequently converted to TAGs. On average the greatest increase in oil was 
observed in seeds of transformants carrying the ctpgpat-tp gene (average +22%). 

The plsB gene increased seed oil content by an average of 15%. The addition of an 
10 ER targeting sequence resulted in an average seed oil increase of 1 8%. 

Seeds of plants transformed with the vector only did not differ significantly in 
average weight from wild type plants. Seeds of individual plants from each construct 
were significantly heavier than wild type and the pHS737 control; e.g. 315-2, 301-2, 
302-6, 303-3 and 304-15. However, increased seed oil content was not always 
15 positively correlated with increased seed weight; e.g. 303-7 and 304-1 . 

Phenotypes presenting increased seed oil content and weight would result in 
increased yield from oilseed crops. Those presenting an increase in seed oil content 
without an increase in weight would provide more oil per tonne of seed, representing 
an additional advantage to oil seed producers. 

20 Compositional changes were also observed in the seed oils of the transformants. 
Selected examples are illustrated in Table 6. 
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TABLE 6: FATTY ACID COMPOSITION OF WILD TYPE AND 
TRANSFORMED ARABIDOPSIS THALIANA 



Construct/sample 
reference no. 


wt% oil 


18:1 


18:3 










wild type 


26.9 ± 0.8 (n= 7) 


14.4 ± 0.7 


15.7 ±0.8 










PHS73 7 (vector) 


26.8 ± 1.1 (n=6) 


12.4 ± 1.3 


16.9 ±0.4 










Ctpgpat 








315-2 


32.4 


13.6 


18.4 


315-4 


30.1 


14.7 


16.5 


315-6 


25.2 


9.1 


18.4 


315-8 


27.1 


9.2 


19.9 


315-10 


28.3 


12.4 


17.5 










ctpgpat-tp 








301-1 


27.5 


12.0 


17.1 


301-2 


34.0 


10.9 


18.8 


301-3 


34.6 


12.2 


18.2 










ctpgpat-tp+errs 








302-4 


28.1 


12.5 


18.5 


302-5 


25.6 


12.2 


17.5 


302-6 


32.7 


12.9 


17.5 


302-7 


27.8 


15.8 


16.5 










plsB 








303-1 


28.4 


12.0 


17.8 


303-3 


30.8 


11.4 


17.6 


303-7 


29.1 


14.7 


17.1 




33.2 


14.8 


15.9 



WO 00/78974 



20 



PCT/CAOO/00738 



T"f*flf*rf*n f*f* tin 


\x/t°/i nil 
W L /0 UI1 


1 8-1 
10.1 


1 G-1 

18.3 










304-2 


32.5 


12.4 


17.0 


304-3 


30.3 


12.1 


16.8 


304-4 


23.5 


10.7 


18.0 


304-9 


27.4 


13.9 


17.4 


304-15 


32.5 


14.1 


16.9 











Seed oils from plants transformed with the vector alone (pHS737) were not 
significantly different from the wild type. 

The proportion of 18:1 Z9 (oleic acid) decreased and the proportion of 18:3 Z9, Z12, 
5 Z15 (a-linolenic acid) increased in some of the individual transformants of all 

contructs. Examples of this effect are 315-6, 315-8, 301-2, 302-5, 303-3 and 304-4. 

INDUSTRIAL APPLICABILITY 

The Kennedy pathway is common to all organisms. Transformation of yeast or 
plants with DNA encoding GPAT activity can be used both to enhance oil content, 
10 and to alter the fatty acid composition of TAGs. The use of GPATs with different 

acyl-CoA or acyl-ACP specificities can be used to tailor the fatty acid composition of 
the TAGs produced by the micro-organism or plant. 

The method of the invention can manipulate oil synthesis in other organisms such as 
yeast, other fungae and algae for producing commodity and speciality oils. 
15 Increasing the oil content of feed quality grains would reduce the need for adding 
exogenous fats in the diets of animals and birds 54 (Kishore and Shewmaker, 1999). 

Saffiower plastidial GPAT, the GPAT used in the examples, prefers unsaturated 
acyl-CoA or acyl-ACP, whereas E. coli GPAT, also used in the examples, prefers 
saturated acyl-CoA or acyl-ACP. These genes can be used to modify the type of 
20 fatty acid at the sn-\ position of TAGs. This enables the production of structured 
TAGs, in which the fatty acids occupying each position may be controlled. It is 
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believed that fatty acid absorption and physiological effect are related to TAG 
structure, and not just gross composition. This also has implications for 
manipulating fat content in humans and other animals. 

DEPOSITS OF BIOLOGICAL MATERIAL 

5 A plasmid library containing a mixture of plasmids (pYES2:c tpgpat-tp, 

pYTLS2\ctpgpat-tp+errs*, pYES2:/?/sB and pYES2:plsB+errs) was deposited, 
according to the Budapest Treaty, on May 24, 2000, at the International Depository 
Authority of Canada (Winnipeg, Manitoba, Canada), under accession number ID AC 
240500-2 and reference pYEASTOIL. 

10 A plasmid library containing a mixture of plasmids (pRS737:ctpgpat, 

pHS737 :ctpgpat-tp, pHS737:ctpgpat-tp+errs, pHS737:/?MB and pHS737:plsB +errs) 
was deposited according to the Budapest Treaty, on May 24, 2000, at the International 
Depository Authority of Canada (Winnipeg, Manitoba, Canada), under accession 
number ID AC 240500-1 and reference pPLANTOIL. 

15 SEQUENCES 

SEQ ID NO: 1 is the DNA ctpgpat (encoding intact safflower plastidial GPAT) 

SEQ ID NO: 2 is the DNA ctpgpat-tp (encoding safflower plastidial GPAT minus 
transit peptide) 

SEQ ID NO: 3 is the DNA ctpgpat-tp^ errs (encoding safflower plastidial GPAT 
20 minus transit peptide plus ER retention sequence) 

SEQ ID NO: 4 is the DNA plsB (encoding E. coli GPAT) 

SEQ ID NO: 5 is plsB+errs (encoding E. coli GPAT plus ER retention sequence) 

SEQ ID NO: 6 is the protein encoded by ctpgpat (SEQ ID NO: 1; intact safflower 
plastidial GPAT) 

25 SEQ ID NO: 7 is the protein encoded by ctpgpat-tp (SEQ ID NO: 2; safflower 
plastidial GPAT minus transit peptide) 
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SEQ ID NO: 8 is the protein encoded by ctpgpat-tp+errs (SEQ ID NO: 3; safflower 
plastidial GPAT minus transit peptide plus ER retention sequence) 

SEQ ID NO: 9 is the protein encoded by plsB (SEQ ID NO: 4; E. coli GPAT) 

SEQ ID NO: 10 is the protein encoded by plsB+errs (SEQ ID NO: 5; E. coli GPAT 
5 plus ER retention sequence) 
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International Depositary Authority of Canada 

Room H5 190,101 5 Arlington Street, Tel: (204) 789-2002 

Winnipeg, Manitoba, Canada R3E 3R2 Fax: (204) 789-2036 

International Form IDAC/BP/4 

5 

RECEIPT IN THE CASE OF AN ORIGINAL DEPOSIT 

(Issued pursuant to Rule 7. 1 of the Budapest Treaty Regulations) 

ATTACH COPIES OF THE ORIGINAL DEPOSIT CONTRACT AND VIABILITY STATEMENT 

10 

This International Depository Authority accepts the deposit of the microorganism specified 
below \ which was received by it on May 24. 2000 

To (Name of Depositor) Dr. S. L. MacKenzie ___ 

15 Address National Research Council of Canada. Plant Biotechnology Institute 

110 Gymnasium Place. Saskatoon. Saskatchewan. S7N 0W9 

IDENTIFICATION OF DEPOSIT 

Reference assigned by depositor pPLANTOIL 

20 Accession Number assigned by this ID A IDAC 240500-1 ; 

The deposit identified above was accompanied by: 

n a scientific description (specify) 



25 I I a proposed taxonomic designation (specify). 



Signature of person(s)authorized to represent IDAC: 
30 



Mav 24. 2000 



Date 



Receipt in the Case of an Original Deposit 



1/1 
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International Depositary Authority of Canada 

Room H5 190, 101 5 Arlington Street, Tel: (204) 789-2002 

Winnipeg, Manitoba, Canada R3E 3R2 Fax: (204) 789-2036 

International Form IDAC/BP/9 

5 

STATEMENT OF VIABILITY 

(Issued pursuant to Rule 10.2 of the Budapest Treaty Regulations) 

PARTY TO WHOM THE VIABILITY STATEMENT IS ISSUED 

10 

Name Dr. S. L. MacKenzie 

Address National Research Council of Canada. Plant Biotechnology Institute 
110 Gymnasium Place. Saskatoon. Saskatchewan. S7N 0W9 

15 DEPOSITOR 

Name Dr. S. L. MacKenzie 

Address National Research Council of Canada. Plant Biotechnology Institute 
1 10 Gymnasium Place. Saskatoon. Saskatchewan. S7N 0W9 

20 

IDENTIFICATION OF THE DEPOSIT 

Accession Number given by the International Depository Authority IDAC 240500-1 

25 Date of the original deposit (or most recent relevant date) May 24.2000 

VIABILITY TEST 

The viability of the deposit identified above was tested on (most recent test date) May 26. 
30 2000 

On the date indicated above, the culture was: 

[X] viable 

I I no longer viable 

35 

Conditions under which the Viability Test were performed (to be filled in if the information 
has been requested and the results of the test were negative) 

Signature of person(s) authorized to represent IDAC 

40 



Date Mav 29. 2000 



45 Statement of Viability 1/1 



WO 00/78974 



25 



PCT/CAOO/00738 



5 



International Depositary Authority of Canada 

Room H5 190,1015 Arlington Street, Tel: (204) 789-2002 
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15 Address National Research Council of Canada. Plant Biotechnology Institute 
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STATEMENT OF VIABILITY 

(Issued pursuant to Rule 10.2 of the Budapest Treaty Regulations) 
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10 

Name Dr. S. L. MacKenzie 

Address National Research Council of Canada. Plant Biotechnology Institute 
110 Gymnasium Place. Saskatoon. Saskatchewan. S7N 0W9 

15 DEPOSITOR 

Name Dr. S. L. MacKenzie 

Address National Research Council of Canada. Plant Biotechnology Institute 
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T IS CLAIMED IS: 

A method for increasing the triacylglyceride content of an organism 
characterised by expressing in the organism an introduced DNA encoding a 
protein having glycerol 3-phosphate acyltransferase (GPAT) activity. 

A method according to claim 1, characterised in that the organism is a plant. 

A method according to claim 2, characterised in that the plant is an oilseed 
bearing plant. 

A method according to claim 2, characterised in that the plant is of the genus 
Brassica. 

A method according to claim 2, characterised in that the plant is Arabidopsis 
thaliana, 

A method according to claim 1 , characterised in that the organism is a yeast. 

A method according to any one of claims 1 to 6, characterised in that the DNA 
encodes a protein comprising a sequence that differs from SEQ ID NO:6 but has 
at least 70% sequence homology with SEQ ID NO: 6 and the same function as 
the protein of SEQ ID NO:6. 

A method according to any one of claims 1 to 6, characterised in that the DNA 
comprises a sequence encoding a protein comprising SEQ ID NO: 6. 

A method according to any one of claims J to 6, characterised in that the DNA 
encodes a protein comprising a sequence that differs from SEQ ID NO:9 but has 
at least 70% sequence homology with SEQ ID NO: 9 and the same function as 
the protein of SEQ ID NO:9. 

A method according to any one of claims 1 to 6, characterised in that the DNA 
comprises a sequence encoding a protein comprising SEQ ID NO: 9. 

A method according to any one of claims 1 to 6, characterised in that the DNA 
is a DNA having a sequence as recited in SEQ ID NO: 1 , or a variant thereof 
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having at least 70% sequence identity to SEQ ID NO:l . 

1 2. A method according to any one of claims 1 to 6, characterised in that the DNA 
is a DNA having a sequence as recited in SEQ ID NO: 2, or a variant thereof 
having at least 70% sequence identity to SEQ ID NO:2. 

13. A method according to any one of claims 1 to 6, characterised in that the DNA 
is a DNA having a sequence as recited in SEQ ID NO: 3, or a variant thereof 
having at least 70% sequence identity to SEQ ID NO:3. 

14. A method according to any one of claims 1 to 6, characterised in that the DNA 
is a DNA having a a sequence as recited in SEQ ID NO: 4, or a variant thereof 
having at least 70% sequence identity to SEQ ID NO:4. 

15. A method according to any one of claims 1 to 6, characterised in that the DNA 
is a DNA having a sequence as recited in SEQ ID NO: 5, or a variant thereof 
having at least 70% sequence identity to SEQ ID NO:5. 

1 6. A method for increasing the triacylglyceride content of an organism by 
transforming the organism with a vector, characterised in that the vector 
comprises a DNA encoding a protein comprising SEQ ID NO: 6, or a protein 
having the same function comprising a sequence having at least 70% homology 
with SEQ ID NO: 6. 

17. A method for increasing the triacylglyceride content of an organism by 
transforming the organism with a vector, characterised in that the vector 
comprises a DNA encoding a protein comprising SEQ ID NO: 7, or a protein 
having the same function comprising a sequence having at least 70% homology 
with SEQ ID NO: 7. 

1 8. A method for increasing the triacylglyceride content of an organism by 
transforming the organism with a vector, characterised in that the vector . 
comprises a DNA encoding a protein comprising SEQ ID NO: 8, or a protein 
having the same function comprising a sequence having at least 70% homology 
with SEQ ID NO: 8. 
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1 9. A method for increasing the triacylglyceride content of an organism by 
transforming the organism with a vector, characterised in that the vector 
comprises a DNA encoding a protein comprising SEQ ID NO: 9, or a protein 
having the same function comprising a sequence having at least 70% homology 
with SEQ ID NO: 9. 

20. A method for increasing the triacylglyceride content of an organism by 
transforming the organism with a vector, characterised in that the vector 
comprises a DNA encoding a protein comprising SEQ ID NO: 10, or a protein 
having the same function comprising a sequence having at least 70% homology 
with SEQ ID NO: 10. 

2L A non-human organism transformed with a DNA, characterised in that the 
DNA encodes a protein having GPAT activity, and the organism, after 
transforming, has enhanced ability to produce triacylglycerides (TAGs). 

22. An organism according to claim 21 , characterised in that the organism is a 
plant. 

23. A non-human organism according to claim 21, characterised in that the 
organism is an oil seed bearing plant. 

24. A non-human organism according to claim 22, characterised in that the plant is 
a member of the genus Brassica. 

25. A non-human organism according to claim 21, that is Arabidopsis thaliana. 

26. A non-human organism according to claim 2 1 , characterised in that the 
organism is a yeast. 

27. A non-human organism according to any one of claims 21 to 26, characterised 
in that the DNA encodes a protein comprising SEQ ID NO: 6, or a protein 
having the same function comprising a sequence having at least 70% sequence 
homology with SEQ ID NO: 6. 

28. A non-human organism according to any one of claims 21 to 26, characterised 
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in that the DNA encodes a protein comprising SEQ ID NO: 7, or a protein 
having the same function comprising a sequence having at least 70% sequence 
homology with SEQ ID NO: 7. 

29. A non-human organism according to any one of claims 21 to 26, characterised 
in that the DNA encodes a protein comprising SEQ ID NO: 8, or a protein 
having the same function comprising a sequence having at least 70% sequence 
homology with SEQ ID NO: 8. 

30. A non-human organism according to any one of claims 21 to 26, characterised 
in that the DNA encodes a protein comprising SEQ ID NO: 9, or a protein 
having the same function comprising a sequence having at least 70% sequence 
homology with SEQ ID NO: 9. 

31. A non-human organism according to any one of claims 21 to 26, characterised 
in that the DNA encodes a protein comprising SEQ ID NO: 10, or a protein 
having the same function comprising a sequence having at least 70% sequence 
homology with SEQ ID NO: 1 0. 

32. A non-human organism according to any one of claims 21 to 26, characterised 
in that the DNA is a DNA having a sequence as recited in SEQ ID NO: 1, or a 
variant thereof having at least 70% sequence identity to SEQ ID NO:l. 

33. A non-human organism according to any one of claims 21 to 26, characterised 
in that the DNA is a DNA having a sequence as recited in SEQ ID NO: 2, or a 
variant thereof having at least 70% sequence identity to SEQ ID NO:2. 

34. A non-human organism according to any one of claims 21 to 26, characterised 
in that the DNA is a DNA having a sequence as recited in SEQ ID NO: 3, or a 
variant thereof having at least 70% sequence identity to SEQ ID NO:3. 

35. A non-human organism according to any one of claims 21 to 26, characterised 
in that the DNA is a DNA having a sequence as recited in SEQ ID NO: 4, or a 
variant thereof having at least 70% sequence identity to SEQ ID NO:4. 

36. A non-human organism according to any one of claims 21 to 26, characterised 
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z« f/za/ the DNA is a DNA having a sequence as recited in SEQ ID NO: 5, or a 
variant thereof having at least 70% sequence identity to SEQ ID NO:5. 

37. A vector for genetically transforming an organism, characterised in that the 
vector comprises a DNA encoding a protein having GPAT activity, and the 
organism, after transforming, exhibits enhanced production of triacylglycerides. 

38. A vector according to claim 37, characterised in that the vector comprises DNA 
encoding a protein comprising SEQ ID NO: 6, or a protein having the same 
function comprising a sequence having at least 70% sequence homology with 
SEQ ID NO: 6. 

39. A vector according to claim 37, characterised in that the vector comprises a 
DNA encoding a protein comprising SEQ ID NO: 7, or a protein having the 
same function comprising a sequence having at least 70% homology with SEQ 
ID NO: 7. 

40. A vector according to claim 37, characterised in that the vector comprises a 
DNA encoding a protein comprising SEQ ID NO: 8, or a protein having the 
same function comprising a sequence having at least 70% homology with SEQ 
ID NO: 8. 

41 . A vector according to claim 37, characterised in that the vector comprises a 
DNA encoding a protein comprising SEQ ID NO: 9, or a protein having the 
same function comprising a sequence having at least 70% homology with SEQ 
ID NO: 9. 

42. A vector according to claim 37, characterised in that the vector comprises a 
DNA encoding a protein comprising SEQ ID NO: 10, or a protein having the 
same function comprising a sequence having at least 70% homology with SEQ 
ID NO: 10. 

43. A method for modifying the fatty acid composition of triacylglycerides 
produced by an organism, characterised in that the organism is transformed with 
a DNA encoding a protein having GPAT activity. 



(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 
International Bureau 

(43) International Publication Date 
28 December 2000 (28.12.2000) 




PCT 



(10) International Publication Number 

WO 00/78974 A2 



(51) International Patent Classification 7 : C12N 15/54, 

15/82, 15/81, C12P 7/64, C12N 1/19, A01H 5/00 



Cambridge Crescent, Saskatoon, Saskatchewan S7H 3P9 
(CA). 



(21) International Application Number: PCT/CA00/00738 

(22) International Filing Date: 20 June 2000 (20.06.2000) 

(25) Filing Language: English 

(26) Publication Language: English 



(30) Priority Data: 

60/139,788 
60/182,905 



21 June 1999 (21.06.1999) US 
16 February 2000 (16.02.2000) US 



(63) Related by continuation (CON) or continuation-in-part 
(CIP) to earlier application: 

US 60/139,788 (CIP) 

Filed on 21 June 1999 (21.06.1999) 

(71) Applicant (for all designated States except USJi NA- 
TIONAL RESEARCH COUNCIL OF CANADA 
[CA/CA]; 1200 Montreal Road, Ottawa, Ontario K1A 0R6 
(CA). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): JAIN, Ravinder, 
K. [CA/CA]; 1000-60 Queen Street, Ottawa, Ontario KIP 
5Y7 (CA). MACKENZIE, Samuel, L. [CA/CA]; 17 



(74) Agents: BAUER-MOORE, Andrew et aL; Kirby, Eades, 
Gale, Baker, P.O. Box 3432, Station D, Ottawa, Ontario 
KIP 6N9 (CA). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CR, CU, CZ, 
DE, DK, DM, DZ, EE, ES, FI, GB, GD, GE, GH, GM, HR, 
HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, 
LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, MX, MZ, 
NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, TJ, TM, 
TR, TT, TZ, UA, UG, US, UZ, VN, YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, 
IT, LU, MC, NL, PT, SE), OAPI patent (BF, BJ, CF, CG, 
CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). 

Published: 

— Without international search report and to be republished 
upon receipt of that report 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



< 



0\ = 

0O (54) Title: OVEREXPRESSION IN YEAST AND PLANTS OF A GENE ENCODING GLYCEROL 3-PHOSPHATE ACYL- 
TRANSFERASE 



(57) Abstract: The invention provides a method for increasing the triacylglyceride content of an organism, and/or for modifying the 
fatty acid composition of the triacyllyceride, by expressing in the organism a DNA encoding a protein having glycerol 3-phosphate 
acyltransferase (GPAT) activity. Preferred examples of the method of the invention are illustrated with Arabidopsis thaliana and 
yeast, using constructs prepared from the plastidial GPAT genes from Safflower and E.Coli. 



WO 00/78974 



1/1 



1 

UJ 

+ 
m 



CQ 



o e 



Ul 

♦ ■ 

< 

CD 



iiinimiiuiiiiiiiiiiiniiiimimiiimimimiii 



iiiiiiiiiiiiitiiiiiiiiiiiiiiiiitiiiiiiii 



«8p 



3 



< 

a. 

O 
a. 



I- ^ 




E 

CO 
CO 



(% »M) »ue^uoo no 



, ■ - PTO/SB/01 (10-01) 

Approved for use through 1 0731/2002. OMB 0651 -0032 
U.S. Patent and Trademark Office; U.S. DEPARTMENT OF COMMERCE 
Under the Paperwork Reduction Act of 1995, no persons are required to respond to a collection of information unless it contains a valid OMB control number. 



PECLARATION FOR UTILITY OR 
DESIGN 
PATENT APPLICATION 
(37CFR1.63) 



□ 



Declaration 
Submitted 
with Initial 
Filing 



Declaration 
OR Submitted after Initial 
Filing (surcharge 
(37 CFR1.16 (e)) 

. required) 



Attorney Docket Number 



First Named Inventor 



44838 



JAIN, Ravinder K. et al. 



COMPLETE IF KNOWN 



Application Number 



Fifing Date 



Art Unit 



Examiner Name 



09/926,805 



As the below named inventor, I hereby declare that: 

My residence, mailing address, and citizenship are as stated below next to my name. 

I believe I am the original and first inventor of the subject matter which is claimed and for which a patent is sought on the invention entitled: 



OVEREXPRESSION IN YEAST AND PLANTS OF A GENE ENCODING GLYCEROL 
3-PHOSPHATE ACYLTRANSFERASE 



(Title of the Invention) 



the specification of which 

is attached hereto 
OR 

[3 was filed on (MM/DD/YYYY) 



06/20/2000 



as United States Application Number or PCT International 



Application Number 



PCT/CAOO/00738 



and was amended on (MM/DD/YYYY) 



(if applicable). 



t hereby state that I have reviewed and understand the contents of the above identified specification, including the claims, as amended by 
any amendment specifically referred to above. 

I acknowledge the duty to disclose information which is material to patentability as defined in 37 CFR 1 .56, including for continuation-in-part 
applications, material information which became available between the filing date of the prior application and the national or PCT 
international filing date of the continuation-in-part application. 

Thereby claim foreign priority benefits under 35 U.S.C. 119(a)-(d) or (f), or 365(b) of any foreign application(s) for patent, inventor's or plant 
breeder's rights certificate(s), or 365(a) of any PCT international application which designated at least one country other than the United 
States of America, listed below and have also identified below, by checking the box, any foreign application for patent, inventor's or plant 
breeder's rights certificate(s), or any PCT international application having a filing date before that of the application on which prionty is 
claimed. 



Prior Foreign Application 
Number(s) 


Country 


Foreign Filing Date 
(MM/DD/YYYY) 


Priority 
Not Claimed 


Certified Copy Attached? 
YES NO 








□ 


□ 


□ 








□ 


□ 


□ 








□ 


□ 


□ 








□ 


□ 


□ 



Additional foreign application numbers are listed on a supplemental priority data sheet PTO/SB/02B attached hereto: 



[Page 1 of 2] 

Burden Hour Statement: This form is estimated to take 21 minutes to complete. Time will vary depending upon the needs of the individual case. Any comments on 
the amount of time you are required to complete this form should be sent to the Chief Information Officer, U.S. Patent and Trademark Office, Washington, DC 
20231. DO NOT SEND FEES OR COMPLETED FORMS TO THIS ADDRESS. SEND TO: Assistant Commissioner for Patents, Washington. DC 20231. 



ui q g B O Eh , f , O N 



PTO/SB/01 (10-01) 
Approved for use through 1 0/31/2002. OMB 0651-0032 
-V , U.S. Patent and Trademark Office; U.S. DEPARTMENT OF COMMERCE 

Under the Paperwork Reduction Act of 1995; no persons are required to respond to a collection of information unless it contains a valid OMB control number. 



DECLARATION — Utility or Design Patent Application 



~. . -j * A \77~[ Customer Number 

Direct all correspondence to. Qf Baf Co(je Labe| 



OR | | Correspondence address below 



EDWIN J. GALE 



Name 



Address . 



P.O. BOX 3432, STATION D 



OTTAWA- — 
City 



ONXARI0— - 

State 



K1P6N9 



ZIP" 



CANADA 

Country 



613-237-6900 

Telephone 



613-237-0045 

Fax 



I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information and belief 
are believed to be true; and further that these statements were made with the knowledge that willful false statements and the like so 
made are punishable by fine or imprisonment, or both, under 18 U.S.C. 1001 and that such willful false statements may jeopardize the 
validity of the application or any patent issued thereon. 



NAME OF SOLE OR FIRST INVENTOR : 



A petition has been filed for this unsigned inventor 



RAVINDER K. 



Given Name 
(first and middle [If any]) 



JAIN 



Family Name^ 
or Surname 



Slanature ^7 > / L-~~ — ' 


Date 


piTAWA r U 

Residence: Cltv 


ONTARIO 

State 


CANADA 

Country 


CANADIAN 

Citizenship 


1000-60 QUEEN STREET 

MaWna Address 


OTTAWA 

City 


ONTARIO 

State 


K1P5Y7 

ZIP 


CANADA 

Country 


NAME OF SECOND INVENTOR: 


[^] A petition has been filed for this unsigned inventor 



SAMUEL L. 

Given Name" ~ T " 
(first and middle [If any]) 



MACKENZIE 



Family Narne^ 
or Surname 



Date v 



.SASKATOON ^ 


SASKATCHEWAN 


CANADA 


CANADIAN 


Residence: City ' ^ 


State 


Country 


Citizenship 


17 CAMBRIDGE CRESCENT 








Mailing Address 








SASKATOON 


SASKATCHEWAN 


S7H 3P9 


CANADA 


city . . 


State 


ZIP 


Country 



| | Additional inventors are being named on the supplemental Additional lnventor(s) sheet(s) PTO/SB/02A attached hereto. 



[Page 2 of 2] 



WO 00/78974 



fte'dPCT/FTO 2 1 DEC 2001 

1/21 



SEQUENCE LISTING 
<110> National Research Council of Canada 

<12 0> Overexpression in Yeast and Plants of a Gene Encoding 
Glycerol 3 -Phosphate Acyltransf erase 

<130> 44838-PT 

<140> 
<141> 

<150> US 60/139788 
<151> 1999-06-21 

<150> US 60/182905 
<151> 2000-02-16 

<160> 10 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 1671 
<212> DNA 

<213> Carthamus tinctorius 
<400> 1 

tctctctctc tcacacacaa cacacaaaac acacactact gctactttct ctctctacta 60 
cactctcctc tcgctatgtc gatcttcttc tctccttcct cccctactct cttcttctcc 120 
accacaaacg caaatcctag ggtttctcct tcatcttcac cttcttctgc cttcactcct 180 
cctctgtctt cttctcgcct ccgcccgatt ctccgggggt ttccgtgcct cgcgttctct 24 0 
gcgccggcga atgccgccca tggcacggcg gagaccgt cc acggcaacaa gtggccgtca 3 00 
ccgtcgtcct cctcctctgc tgctacgcaa ccgtccgctg gatccgacca cggtcactct 360 
cgtacattca tcgatgctcg ttccgaacaa gatcttcttt ctggaattca aagagagttg 42 0 
gaagctggaa cactgccaaa acatattgct caagcaatgg aggagctata tcagaactac 480 
aaaaatgcag ttctccaaag tgcggctcct, catgcagaag atattgtgtt gtcaaacatg 54 0 
cgtgtagcgt ttgatcgtat gttcttggat gtgaaggagc cgtttgaatt ttcaccatat 600 
catgaagcta ttttggaacc ttttaactac tatatgtttg gtcaaaatta tattcggcct 660 
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cgtggttttg 


atctcaaacc 


atcaaacaga 


agcagatcca 
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gctgttattg 


ccttgatgct 


tgaaacaaca 


aacccccata 


tttctgagaa 


cataatctac 


840 


gtggcagggg 


atagagtaat 


aacagatcct 


ctttgcaagc 


ctttcagcat 


gggaaggaat 


900 


ctgttgtgcg 


tgtattcaaa 


aaagcatatg 


aatgatgttc 


ctgagcttgc 


tgagatgaaa 


960 


aaaagatcaa 


atacaagaag 


tttaaaagag 


atggctttgc 


ttttgagggg 


cggatctaaa 


1020 


ataatatgga 


ttgcgccaag 


tggtggcagg 


gacaggccag 


atcctatcac 


aaatcagtgg 


1080 


tttccggcac 


cgtttgatgc 


cacttcgctt 


gacaacatga 


gaaggctcgt 


ggaccatgct 


1140 


ggtttggtgg 


gtcacatata 


tcctttagcc 


atattgtgcc 


atgacatcat 


gccccctcct 


1200 


cttcaggttg 


agaaagaaat 


tggagagaag 


aggttgatct 


cttttcatgg 


caccggaata 


1260 


tcagtggcac 


cggaaattaa 


tttccaagaa 


gttactgcct 


cttgtgggtc 


ccccgaggag 


1320 


gcgaaggcag 


cttattcaca 


ggcactctat 


gattccgtgt 


gtgaacaata 


caaggtgcta 


1380 


cattctgcgg 


tacatggagg 


aaaagggtta 


gaagcatcaa 


caccaagtgt 


ctcgttgtca 


1440 


caacccttgc 


agtttctcga 


ttagtctctt 


ggtttagagg 


aggtgaaagc 


atattctttt 


1500 


gtttagatga 


cataggtgta 


tagatgatac 


cgaagaatag 


atgtacaaac 


aagtgataga 


1560 


aagatgtatg 


tctaatcaaa 


aaatgttttc 


tgcatcttgt 


aaagggatct 


tcaaaacaga 


1620 


ccttttattt 


tagctgcagc 


aaccaatata 


tcaaaacagg 


tttttctttt 


t 


1671 



<210> 2 
<211> 1130 
<212> DNA 

<213> Carthamus tinctorius 
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ggatccatgc 


acggtcactc 


tcgtacattc 


atcgatgctc 


gttccgaaca 


agatcttctt 


60 


tctggaattc 


aaagagagtt 


ggaagctgga 


acactgccaa 


aacatattgc 


tcaagcaatg 


12 0 


gaggagctat 


atcagaacta 


caaaaatgca 


gttctccaaa 


gtgcggctcc 


tcatgcagaa 


180 


gatattgtgt 


tgtcaaacat 


gcgtgtagcg 


tttgatcgta 


tgttcttgga 


tgtgaaggag 


240 


ccgtttgaat 


tttcaccata 


tcatgaagct 


attttggaac 


cttttaacta 


ctatatgttt 


300 


ggtcaaaatt 


atattcggcc 


tttggtcaat 


ttcagggaat 


catacgttgg 


caatgtctcc 


360 


gttttcggtg 


taatggaaga 


gcagcttaag 


cagggtgaca 


aggtggtttt 


gatctcaaac 


420 


catcaaacag 


aagcagatcc 


agctgttatt 


gccttgatgc 


ttgaaacaac 


aaacccccat 


480 


atttctgaga 


acataatcta 


cgtggcaggg 


gatagagtaa 


taacagatcc 


tctttgcaag 


54 0 


cctttcagca 


tgggaaggaa 


tctgttgtgc 


gtgtattcaa 


aaaagcatat 


gaatgatgtt 


600 


cctgagcttg 


ctgagatgaa 


aaaaagatca 


aatacaagaa 


gtttaaaaga 


gatggctttg 


660 


cttttgaggg 


gcggatctaa 


aataatatgg 


attgcgccaa 


gtggtggcag 


ggacaggcca 


720 


gatcctatca 


caaatcagtg 


gtttccggca 


ccgtttgatg 


ccacttcgct 


tgacaacatg 


780 


agaaggctcg 


tggaccatgc 


tggtttggtg 


ggtcacatat 


atcctttagc 


catattgtgc 


840 


catgacatca 


tgccccctcc 


tcttcaggtt 


gagaaagaaa 


ttggagagaa 


gaggttgatc 


900 
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tcttttcatg gcaccggaat atcagtggca ccggaaatta atttccaaga agttactgcc 960 
tcttgtgggt cccccgagga ggcgaaggca gcttattcac aggcactcta tgattccgtg 1020 
tgtgaacaat acaaggtgct acattctgcg gtacatggag gaaaagggtt agaagcatca 1080 
acaccaagtg tctcgttgtc acaacccttg cagtttctcg attaggatcc 113 0 

<210> 3 
<211> 1130 
<212> DNA 

<213> Carthamus tinctorius 



<400> 3 














ggatccatgc 


acggtcactc 


tcgtacattc 


atcgatgctc 


gttccgaaca 


agatcttctt 


60 


tctggaattc 


aaagagagtt 


ggaagctgga 


acactgccaa 


aacatattgc 


tcaagcaatg 


120 


gaggagctat 


atcagaacta 


caaaaatgca 


gttctccaaa 


gtgcggctcc 


tcatgcagaa 


180 


gatattgtgt 


tgtcaaacat 


gcgtgtagcg 


tttgatcgta 


tgttcttgga 


tgtgaaggag 


240 


ccgtttgaat 


tttcaccata 


tcatgaagct 


attttggaac 


cttttaacta 


ctatatgttt 


300 


ggtcaaaatt 


atattcggcc 


tttggtcaat 


ttcagggaat 


catacgttgg 


caatgtctcc 


360 


gttttcggtg 


taatggaaga 


gcagcttaag 


cagggtgaca 


aggtggtttt 


gatctcaaac 


420 


catcaaacag 


aagcagatcc 


agctgttatt 


gccttgatgc 


ttgaaacaac 


aaacccccat 


480 


atttctgaga 


acataatcta 


cgtggcaggg 


gatagagtaa 


taacagatcc 


tctttgcaag 


54 0 


cctttcagca 


tgggaaggaa 


tctgttgtgc 


gtgtattcaa 


aaaagcatat 


gaatgatgtt 


600 


cctgagcttg 


ctgagatgaa 


aaaaagatca 


aatacaagaa 


gtttaaaaga 


gatggctttg 


660 


cttttgaggg 


gcggatctaa 


aataatatgg 


attgcgccaa 


gtggtggcag 


ggacaggcca 


720 


gatcctatca 


caaatcagtg 


gtttccggca 


ccgtttgatg 


ccacttcgct 


tgacaacatg 


780 


agaaggctcg 


tggaccatgc 


tggtttggtg 


ggtcacatat 


atcctttagc 


catattgtgc 


840 


catgacatca 


tgccccctcc 


tcttcaggtt 


gagaaagaaa 


ttggagagaa 


gaggttgatc 


900 


tcttttcatg 


gcaccggaat 


atcagtggca 


ccggaaatta 


atttccaaga 


agttactgcc 


960 


tcttgtgggt 


cccccgagga 


ggcgaaggca 


gcttattcac 


aggcactcta 


tgattccgtg 


1020 


tgtgaacaat 


acaaggtgct 


acattctgcg 


gtacatggag 


gaaaagggtt 


agaagcatca 


1080 


acaccaagtg 


tctcgttgtc 


acaacccaag 


cagaaactcg 


attaggatcc 




1130 



<210> 4 
<211> 2502 
<212> DNA 

<213> Escherichia coli 
<400> 4 

agatcttccc atgactttct gctatccttg ccgcgcattt gcattattaa ccagaggctt 60 
tacatcgttt atgtccggct ggccacgaat ttactacaaa ttactgaatt taccattaag 120 
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catcctggta aaaagcaagt ctattccggc 
ctctcgtcca attatgtacg ttttaccgta 
cgcccagtgt ctggcacatg acttgcctga 
gctactgccg cgctatgtgt tcattcacgg 
gaaagaagag tctattaagc tgttccacga 
tctggatgtg cagatggtgc cagtgtcggt 
aggcgaagtg aacccgccgc tgcgtatgct 
gtggctcggt cgcgacagtt ttgtgcgttt 
ggatgaacac ggcacggata aaactatcgc 
ctttgcccgt caacgtctgg ctgccgtagg 
taataagctg ctcgcctccc gcgccattgc 
aaaaatctcc catgaaaaag cgcagcagaa 
gaatttctct tacgagatga ttcgcctgac 
actttaccag ggcatcaacg tccataacgc 
tcatgagctg gtatatgtgc cttgccaccg 
cgtgctgtat caccaggggc tggtgccgcc 
ctggcctgcc gggccgattt tccgccgtct 
aggcaataaa ctttattcca ccgttttccg 
ttattccgtc gagtacttcg tggaaggcgg 
gaaaactggt acgctgtcga tgaccattca 
tacgctgatt ccgatctata tcggttatga 
agaactgcgc ggcgcgacga aagagaaaga 
caagctgcgt aatctcggtc agggttacgt 
ctaccttaac cagcatgtac ctgactggcg 
tccggcatgg ttaacgccga cggtcaataa 
caacgcaggc gcggcaaacg ccatgaacct 
gcgctcactc acccgcgagc agttaaccga 
caacgtgccc tactccacgg actctaccgt 
tcacgcgctg caaatgaaca agtttgaagt 
tctgccgcgc gagcaagcgg tgctgatgac 
ggtgctgcct tcgctgatgg cgacaatcgt 
attgatggag cacgtcaatg tgctttaccc 
ggatcgcgac gagttgccgg acgttattga 
gctgattacc ctgcaagatg atgagttgca 
gctgctggcc gcaggcgcgc gcgaaacgct 
gagtgccaac ccgtcgatca accgcggtac 
acgtctctcc gtgctgcacg gcatcaacgc 
ttctctggtg ctgacactgc gtgatgaagg 



agatcctgcc ccggaactgg ggctggatac 180 
caactcgaaa gcagatttgc tgacgttgcg 24 0 
cccgttagag ccgctggaaa tcgacggcac 30 0 
cgggccgcgt gtgttcacct attacacgcc 360 
ctatctcgat ttgcaccgta gcaacccaaa 420 
gatgtttggt cgcgcgccgg ggcgtgaaaa 480 
taacggcgta cagaaatttt tcgctgtact 54 0 
ctcgccgtca gtttcgctgc gccgtatggc 600 
tcagaaactg gcgcgcgtgg cgcgtatgca 66 0 
cccacgtttg cctgctcgtc aggatctgtt 720 
caaagcggta gaagatgaag cgcgcagcaa 78 0 
cgcgattgca ctgatggaag agattgcggc 84 0 
tgaccgtatt ctgggcttca cctggaaccg 900 
tgagcgcgtt cgccagctgg cccacgacgg 960 
cagtcacatg gactacctgc tgctttctta 1020 
gcatatcgcc gccgggatca acctgaattt 1080 
gggggcgttc tttattcgcc gtacgtttaa 114 0 
ggagtatctc ggcgaactgt tcagccgtgg 12 00 
tcgttcccgt acggggcgtt tgctggatcc 1260 
ggcgatgctg cgtggcggca cgcgtccgat 1320 
gcacgtcatg gaagtgggta cttacgccaa 1380 
gagcctgccg cagatgctgc gcggtttaag 1440 
caacttcggt gaaccaatgc cgttgatgac 1500 
tgaatctatc gatcccatcg aagcggtgcg 1560 
tattgctgcc gatctgatgg tacgcattaa 1620 
gtgctgtact gcgctactgg catcacgtca 1680 
gcaactcaac tgctacctgg atctgatgcg 1740 
tccttcagcc agcgccagcg agcttatcga 1800 
cgagaaagac acaatcggcg aca teat cat 18§0 
ctactatcgc aacaacattg cgcatatgtt 192 0 
cacccagcat cgccacatct cccgcgacgt 1980 
aatgctgaaa gcggagctgt tcctgcgctg 2040 
tgcgctggca aatgagatgc aacgtcaggg 2100 
tatcaacccg gcgcattctc gcacgctaca 2160 
geaaegttat gccatcacct tctggttgtt 2220 
gctggagaaa gagagecgea ccgtcgcgca 22 80 
geeggagtte ttcgacaagg cggtgttcag 234 0 
gtatatcagc gatageggeg atgccgaacc 24 00 
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ggcagaaacg atgaaggttt atcagttgct ggcggagttg attacatcag acgtgcgttt 2460 
gacgattgag agtgcgacgc agggcgaagg gtaatcagat ct 2502 

<210> 5 
<211> 2502 
<212> DNA 

<213> Escherichia coli 



<400> 5 



agatcttccc 


atgactttct 


gctatccttg 


ccgcgcattt 


gcattattaa 


ccagaggctt 


60 


tacatcgttt 


atgtccggct 


ggccacgaat 


ttactacaaa 


ttactgaatt 


taccattaag 


120 


catcctggta 


aaaagcaagt 


ctattccggc 


agatcctgcc 


ccggaactgg 


ggctggatac 


180 


ctctcgtcca 


attatgtacg 


ttttaccgta 


caactcgaaa 


gcagatttgc 


tgacgttgcg 


240 


cgcccagtgt 


ctggcacatg 


acttgcctga 


cccgttagag 


ccgctggaaa 


tcgacggcac 


300 


gctactgccg 


cgctatgtgt 


tcattcacgg 


cgggccgcgt 


gtgttcacct 


attacacgcc 


360 


gaaagaagag 


tctattaagc 


tgttccacga 


ctatctcgat 


ttgcaccgta 


gcaacccaaa 


420 


tctggatgtg 


cagatggtgc 


cagtgtcggt 


gatgtttggt 


cgcgcgccgg 


ggcgtgaaaa 


480 


sggcgaagtg 


aacccgccgc 


tgcgtatgct 


taacggcgta 


cagaaatttt 


tcgctgtact 


540 


gtggctcggt 


cgcgacagtt 


ttgtgcgttt 


ctcgccgtca 


gtttcgctgc 


gccgtatggc 


600 


ggatgaacac 


ggcacggata 


aaactatcgc 


tcagaaactg 


gcgcgcgtgg 


cgcgtatgca 


660 


ctttgcccgt 


caacgtctgg 


ctgccgtagg 


cccacgtttg 


cctgctcgtc 


aggatctgtt 


720 


taataagctg 


ctcgcctccc 


gcgccattgc 


caaagcggta 


gaagatgaag 


cgcgcagcaa 


780 


aaaaatctcc 


catgaaaaag 


cgcagcagaa 


cgcgattgca 


ctgatggaag 


agattgcggc 


840 


gaatttctct 


tacgagatga 


ttcgcctgac 


tgaccgtatt 


ctgggcttca 


cctggaaccg 


900 


actttaccag 


ggcatcaacg 


tccataacgc 


tgagcgcgtt 


cgccagctgg 


cccacgacgg 


960 


tcatgagctg 


gtatatgtgc 


cttgccaccg 


cagtcacatg 


gactacctgc 


tgctttctta 


1020 


cgtgctgtat 


caccaggggc 


tggtgccgcc 


gcatatcgcc 


gccgggatca 


acctgaattt 


1080 


ctggcctgcc 


gggccgattt 


tccgccgtct 


gggggcgttc 


tttattcgcc 


gtacgtttaa 


1140 


aggcaataaa 


ctttattcca 


ccgttttccg 


ggagtatctc 


ggcgaactgt 


tcagccgtgg 


1200 


ttattccgtc 


gagtacttcg 


tggaaggcgg 


tcgttcccgt 


acggggcgtt 


tgctggatcc 


1260 


gaaaactggt 


acgctgtcga 


tgaccattca 


ggcgatgctg 


cgtggcggca 


cgcgtccgat 


1320 


tacgctgatt 


ccgatctata 


tcggttatga 


gcacgtcatg 


gaagtgggta 


cttacgccaa 


1380 


agaactgcgc 


ggcgcgacga 


aagagaaaga 


gagcctgccg 


cagatgctgc 


gcggtttaag 


1440 


caagctgcgt 


aatctcggtc 


agggttacgt 


caacttcggt 


gaaccaatgc 


cgttgatgac 


1500 


ctaccttaac 


cagcatgtac 


ctgactggcg 


tgaatctatc 


gatcccatcg 


aagcggtgcg 


1560 


tccggcatgg 


ttaacgccga 


cggtcaataa 


tattgctgcc 


gatctgatgg 


tacgcattaa 


1620 


caacgcaggc 


gcggcaaacg 


ccatgaacct 


gtgctgtact 


gcgctactgg 


catcacgtca 


1680 


gcgctcactc 


acccgcgagc 


agttaaccga 


gcaactcaac 


tgctacctgg 


atctgatgcg 


1740 


caacgtgccc 


tactccacgg 


actctaccgt 


tccttcagcc 


agcgccagcg 


agcttatcga 


1800 
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tcacgcgctg 


caaatgaaca 


agtttgaagt 


cgagaaagac 


acaatcggcg 


acatcatcat 


1860 


tctgccgcgc 


gagcaagcgg 


tgctgatgac 


ctactatcgc 


aacaacattg 


cgcatatgtt 


1920 


ggtgctgcct 


tcgctgatgg 


cggcaatcgt 


cacccagcat 


cgccacatct 


cccgcgacgt 


1980 


attgatggag 


cacgtcaatg 


tgctttaccc 


aatgctgaaa 


gcggagctgt 


tcctgcgctg 


2040 


ggatcgcgac 


gagttgccgg 


acgttattga 


tgcgctggca 


aatgagatgc 


aacgtcaggg 


2100 


gctgattacc 


ctgcaagatg 


atgagttgca 


tatcaacccg 


gcgcattctc 


gcacgctaca 


2160 


gctgctggcc 


gcaggcgcgc 


gcgaaacgct 


gcaacgttat 


gccatcacct 


tctggttgtt 


2220 


gagtgccaac 


ccgtcgatca 


accgcggtac 


gctggagaaa 


gagagccgca 


ccgtcgcgca 


2280 


acgtctctcc 


gtgctgcacg 


gcatcaacgc 


gccggagttc 


ttcgacaagg 


cggtgttcag 


2340 


ttctctggtg 


ctgacactgc 


gtgatgaagg 


gtatatcagc 


gatagcggcg 


atgccgaacc 


2400 


ggcagaaacg 


atgaaggttt 


atcagttgct 


ggcggagttg 


attacatcag 


acgtgcgttt 


2460 


gacgattgag 


agtgcgaagc 


agaaggaagg 


gtaatcagat 


ct 




2502 



<210> 6 
<211> 487 
<212> PRT 

<213> Carthamus tine tor ius 
<400> 6 

Ser Leu Ser Leu Thr His Asn Thr Gin Asn Thr His Tyr Cys Tyr Phe 
15 10 15 

Leu Ser Leu Leu His Ser Pro Leu Ala Met Ser lie Phe Phe Ser Pro 
20 25 30 

Ser Ser Pro Thr Leu Phe Phe Ser Thr Thr Asn Ala Asn Pro Arg Val 
35 40 45 

Ser Pro Ser Ser Ser Pro Ser Ser Ala Phe Thr Pro Pro Leu Ser Ser 
50 55 60 

Ser Arg Leu Arg Pro lie Leu Arg Gly Phe Pro Cys Leu Ala Phe Ser 
65 70 75 80 

Ala Pro Ala Asn Ala Ala His Gly Thr Ala Glu Thr Val His Gly Asn 
85 90 95 

Lys Trp Pro Ser Pro Ser Ser Ser Ser Ser Ala Ala Thr Gin Pro Ser 
100 105 110 
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Ala Gly Ser Asp His Gly His Ser Arg Thr Phe lie Asp Ala Arg Ser 
115 120 125 

Glu Gin Asp Leu Leu Ser Gly lie Gin Arg Glu Leu Glu Ala Gly Thr 
130 135 140 

Leu Pro Lys His lie Ala Gin Ala Met Glu Glu Leu Tyr Gin Asn Tyr 
145 150 155 160 

Lys Asn Ala Val Leu Gin Ser Ala Ala Pro His Ala Glu Asp lie Val 
165 170 175 

Leu Ser Asn Met Arg Val Ala Phe Asp Arg Met Phe Leu Asp Val Lys 
180 185 190 

Glu Pro Phe Glu Phe Ser Pro Tyr His Glu Ala lie Leu Glu Pro Phe 
195 200 205 

Asn Tyr Tyr Met Phe Gly Gin Asn Tyr lie Arg Pro Leu Val Asn Phe 
210 215 220 

Arg Glu Ser Tyr Val Gly Asn Val Ser Val Phe Gly Val Met Glu Glu 
225 230 235 240 

Gin Leu Lys Gin Gly Asp Asn Val Val Leu lie Ser Asn His Gin Thr 
245 250 255 

Glu Ala Asp Pro Ala Val lie Ala Leu Met Leu Glu Thr Thr Asn Pro 
260 265 270 

His lie Ser Glu Asn lie lie Tyr Val Ala Gly Asp Arg Val lie Thr 
275 280 285 

Asp Pro Leu Cys Lys Pro Phe Ser Met Gly Arg Asn Leu Leu Cys Val 
290 295 300 



Tyr Ser Lys Lys His Met Asn Asp Val Pro Glu Leu Ala Glu Met Lys 
305 310 315 320 



WO 00/78974 



8/21 



PCT/CA00/00738 



Lys Arg Ser Asn 

Gly Gly Ser Lys 
340 

Pro Asp Pro lie 
355 

Ser Leu Asp Asn 
370 

His lie Tyr Pro 
385 

Leu Gin Val Glu 



Gly Thr Gly lie 
420 

Ala Ser Cys Gly 
435 

Leu Tyr Asp Ser 
450 

His Gly Gly Lys 
465 

Gin Pro Leu Gin 



Thr Arg Ser Leu 
325 

lie lie Trp lie 

Thr Asn Gin Trp 
360 

Met Arg Arg Leu 
375 

Leu Ala lie Leu 
390 

Lys Glu lie Gly 
405 

Ser Val Ala Pro 

Ser Pro Glu Glu 
440 

Val Cys Glu Gin 
455 

Gly Leu Glu Ala 
470 

Phe Leu Asp 
485 



Lys Glu Met Ala 
330 

Ala Pro Ser Gly 
345 

Phe Pro Ala Pro 

Val Asp His Ala 
380 

Cys His Asp lie 
395 

Glu Lys Arg Leu 
410 

Glu lie Asn Phe 
425 

Ala Lys Ala Ala 

Tyr Lys Val Leu 
460 

Ser Thr Pro Ser 
475 



Leu Leu Leu Arg 
335 

Gly Arg Asp Arg 
350 

Phe Asp Ala Thr 
365 

Gly Leu Val Gly 

Met Pro Pro Pro 
400 

lie Ser Phe His 
415 

Gin Glu Val Thr 
430 

Tyr Ser Gin Ala 
445 

His Ser Ala Val 

Val Ser Leu Ser 
480 



<210> 7 
<211> 374 
<212> PRT 

<213> Carthamus tinctorius 
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<400> 7 

Gly Ser Met His Gly His Ser Arg Thr Phe lie Asp Ala Arg Ser Glu 
1 5 10 15 

Gin Asp Leu Leu Ser Gly lie Gin Arg Glu Leu Glu Ala Gly Thr Leu 
20 25 30 

Pro Lys His lie Ala Gin Ala Met Glu Glu Leu Tyr Gin Asn Tyr Lys 
35 40 45 

Asn Ala Val Leu Gin Ser Ala Ala Pro His Ala Glu Asp lie Val Leu 
50 55 60 

Ser Asn Met Arg Val Ala Phe Asp Arg Met Phe Leu Asp Val Lys Glu 
65 70 75 80 

Pro Phe Glu Phe Ser Pro Tyr His Glu Ala lie Leu Glu Pro Phe Asn 
85 90 95 

Tyr Tyr Met Phe Gly Gin Asn Tyr lie Arg Pro Leu Val Asn Phe Arg 
100 105 110 

Glu Ser Tyr Val Gly Asn Val Ser Val Phe Gly Val Met Glu Glu Gin 
115 120 125 

Leu Lys Gin Gly Asp Lys Val Val Leu lie Ser Asn His Gin Thr Glu 
130 135 140 

Ala Asp Pro Ala Val lie Ala Leu Met Leu Glu Thr Thr Asn Pro His 
145 150 155 160 

lie Ser Glu Asn lie lie Tyr Val Ala Gly Asp Arg Val lie Thr Asp 
165 170 175 

Pro Leu Cys Lys Pro Phe Ser Met Gly Arg Asn Leu Leu Cys Val Tyr 
180 185 190 



Ser Lys Lys His Met Asn Asp Val Pro Glu Leu Ala Glu Met Lys Lys 
195 200 205 
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Arg Ser Asn Thr 
210 

Gly Ser Lys lie 
225 

Asp Pro lie Thr 



Leu Asp Asn Met 
260 

lie Tyr Pro Leu 
275 

Gin Val Glu Lys 
290 

Thr Gly lie Ser 
305 

Ser Cys Gly Ser 

Tyr Asp Ser Val 
340 

Gly Gly Lys Gly 
355 

Pro Leu Gin Phe 
370 



<210> 8 
<211> 374 
<212> PRT 
<213> Carthamus 



Arg Ser Leu Lys 
215 

lie Trp lie Ala 
230 

Asn Gin Trp Phe 
245 

Arg Arg Leu Val 

Ala lie Leu Cys 
280 

Glu lie Gly Glu 
295 

Val Ala Pro Glu 
310 

Pro Glu Glu Ala 
325 

Cys Glu Gin Tyr 



Leu Glu Ala Ser 
360 

Leu Asp 



tinctorius 



Glu Met Ala Leu 
220 

Pro Ser Gly Gly 
235 

Pro Ala Pro Phe 
250 

Asp His Ala Gly 
265 

His Asp lie Met 

Lys Arg Leu lie 
300 

lie Asn Phe Gin 
315 

Lys Ala Ala Tyr 
330 

Lys Val Leu His 
345 

Thr Pro Ser Val 



Leu Leu Arg Gly 



Arg Asp Arg Pro 
240 

Asp Ala Thr Ser 
255 

Leu Val Gly His 
270 

Pro Pro Pro Leu 
285 

Ser Phe His Gly 



Glu Val Thr Ala 
320 

Ser Gin Ala Leu 
335 

Ser Ala Val His 
350 

Ser Leu Ser Gin 
365 
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<400> 8 

Gly Ser Met His Gly His Ser Arg Thr Phe lie Asp Ala Arg Ser Glu 
15 10 15 

Gin Asp Leu Leu Ser Gly lie Gin Arg Glu Leu Glu Ala Gly Thr Leu 
20 25 30 

Pro Lys His lie Ala Gin Ala Met Glu Glu Leu Tyr Gin Asn Tyr Lys 
35 40 45 

Asn Ala Val Leu Gin Ser Ala Ala Pro His Ala Glu Asp He Val Leu 
50 55 60 

Ser Asn Met Arg Val Ala Phe Asp Arg Met Phe Leu Asp Val Lys Glu 
65 70 75 80 

Pro Phe Glu Phe Ser Pro Tyr His Glu Ala He Leu Glu Pro Phe Asn 
85 90 95 

Tyr Tyr Met Phe Gly Gin Asn Tyr He Arg Pro Leu Val Asn Phe Arg 
100 105 110 

Glu Ser Tyr Val Gly Asn Val Ser Val Phe Gly Val Met Glu Glu Gin 
115 120 125 

Leu Lys Gin Gly Asp Lys Val Val Leu He Ser Asn His Gin Thr Glu 
130 135 140 

Ala Asp Pro Ala Val He Ala Leu Met Leu Glu Thr Thr Asn Pro His 
145 150 155 160 

He Ser Glu Asn He He Tyr Val Ala Gly Asp Arg Val He Thr Asp 
165 170 175 

Pro Leu Cys Lys Pro Phe Ser Met Gly Arg Asn Leu Leu Cys Val Tyr 
180 185 190 



Ser Lys Lys His Met Asn Asp Val Pro Glu Leu Ala Glu Met Lys Lys 
195 200 205 
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Arg Ser Asn Thr 
210 

Gly Ser Lys lie 
225 

Asp Pro He Thr 



Leu Asp Asn Met 
260 

He Tyr Pro Leu 
275 

Gin Val Glu Lys 
290 

Thr Gly lie Ser 
305 

Ser Cys Gly Ser 

Tyr Asp Ser Val 
340 

Gly Gly Lys Gly 
355 

Pro Lys Gin Lys 
370 



Arg Ser Leu Lys 
215 

He Trp lie Ala 
230 

Asn Gin Trp Phe 
245 

Arg Arg Leu Val 

Ala lie Leu Cys 
280 

Glu He Gly Glu 
295 

Val Ala Pro Glu 
310 

Pro Glu Glu Ala 
325 

Cys Glu Gin Tyr 

Leu Glu Ala Ser 
360 

Leu Asp 



Glu Met Ala Leu 
220 

Pro Ser Gly Gly 
235 

Pro Ala Pro Phe 
250 

Asp His Ala Gly 
265 

His Asp He Met 

Lys Arg Leu He 
300 

He Asn Phe Gin 
315 

Lys Ala Ala Tyr 
330 

Lys Val Leu His 
345 

Thr Pro Ser Val 



Leu Leu Arg Gly 



Arg Asp Arg Pro 
240 

Asp Ala Thr Ser 
255 

Leu Val Gly His 
270 

Pro Pro Pro Leu 
285 

Ser Phe His Gly 

Glu Val Thr Ala 
320 

Ser Gin Ala Leu 
335 

Ser Ala Val His 
350 

Ser Leu Ser Gin 
365 



<210> 9 

<211> 827 

<212> PRT 

<213> Escherichia coli 
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<400> 9 

Met Thr Phe Cys Tyr Pro Cys Arg Ala Phe Ala Leu Leu Thr Arg Gly 
1 .5 10 15 

Phe Thr Ser Phe Met Ser Gly Trp Pro Arg lie Tyr Tyr Lys Leu Leu 
20 25 30 

Asn Leu Pro Leu Ser lie Leu Val Lys Ser Lys Ser lie Pro Ala Asp 
35 40 45 

Pro Ala Pro Glu Leu Gly Leu Asp Thr Ser Arg Pro lie Met Tyr Val 
50 55 60 

Leu Pro Tyr Asn Ser Lys Ala Asp Leu Leu Thr Leu Arg Ala Gin Cys 
65 70 75 80 

Leu Ala His Asp Leu Pro Asp Pro Leu Glu Pro Leu Glu lie Asp Gly 
85 90 95 

Thr Leu Leu Pro Arg Tyr Val Phe He His Gly Gly Pro Arg Val Phe 
100 105 110 

Thr Tyr Tyr Thr Pro Lys Glu Glu Ser He Lys Leu Phe His Asp Tyr 
115 120 125 

Leu Asp Leu His Arg Ser Asn Pro Asn Leu Asp Val Gin Met Val Pro 
130 135 140 

Val Ser Val Met Phe Gly Arg Ala Pro Gly Arg Glu Lys Gly Glu Val 
145 150 155 160 

Asn Pro Pro Leu Arg Met Leu Asn Gly Val Gin Lys Phe Phe Ala Val 
165 170 175 

Leu Trp Leu Gly Arg Asp Ser Phe Val Arg Phe Ser Pro Ser Val Ser 
180 185 190 



Leu Arg Arg Met Ala Asp Glu His Gly Thr Asp Lys Thr He Ala Gin 
195 200 205 
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Lys Leu Ala Arg Val 
210 

Ala Val Gly Pro Arg 
225 

Leu Ala Ser Arg Ala 
245 

Lys Lys lie Ser His 
260 

Glu Glu lie Ala Ala 
275 

Arg lie Leu Gly Phe 
290 

His Asn Ala Glu Arg 
305 

Val Tyr Val Pro Cys 
325 

Tyr Val Leu Tyr His 
340 

He Asn Leu Asn Phe 
355 

Ala Phe Phe He Arg 
370 



Ala Arg Met His Phe Ala 
215 

Leu Pro Ala Arg Gin Asp 
230 235 

He Ala Lys Ala Val Glu 
250 

Glu Lys Ala Gin Gin Asn 
265 

Asn Phe Ser Tyr Glu Met 
280 

Thr Trp Asn Arg Leu Tyr 
295 

Val Arg Gin Leu Ala His 
310 315 

His Arg Ser His Met Asp 
330 

Gin Gly Leu Val Pro Pro 
345 

Trp Pro Ala Gly Pro He 
360 

Arg Thr Phe Lys Gly Asn 
375 



Arg Gin Arg Leu Ala 
220 

Leu Phe Asn Lys Leu 
240 

Asp Glu Ala Arg Ser 
255 

Ala He Ala Leu Met 
270 

He Arg Leu Thr Asp 
285 

Gin Gly He Asn Val 
300 

Asp Gly His Glu Leu 
320 

Tyr Leu Leu Leu Ser 
335 

His He Ala Ala Gly 
350 

Phe Arg Arg Leu Gly 
365 

Lys Leu Tyr Ser Thr 
380 



Val Phe Arg Glu 
385 

Glu Tyr Phe Val 



Tyr Leu Gly Glu 
390 

Glu Gly Gly Arg 
405 



Leu Phe Ser Arg 
395 

Ser Arg Thr Gly 
410 



Gly Tyr Ser Val 
400 

Arg Leu Leu Asp 
415 
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Pro Lys Thr Gly Thr Leu Ser Met Thr lie Gin Ala Met Leu Arg Gly 
420 425 430 

Gly Thr Arg Pro lie Thr Leu lie Pro lie Tyr lie Gly Tyr Glu His 
435 440 445 

Val Met Glu Val Gly Thr Tyr Ala Lys Glu Leu Arg Gly Ala Thr Lys 
450 455 460 

Glu Lys Glu Ser Leu Pro Gin Met Leu Arg Gly Leu Ser Lys Leu Arg 
465 470 475 480 

Asn Leu Gly Gin Gly Tyr Val Asn Phe Gly Glu Pro Met Pro Leu Met 
485 490 495 

Thr Tyr Leu Asn Gin His Val Pro Asp Trp Arg Glu Ser lie Asp Pro 
500 505 510 

lie Glu Ala Val Arg Pro Ala Trp Leu Thr Pro Thr Val Asn Asn lie 
515 520 525 

Ala Ala Asp Leu Met Val Arg lie Asn Asn Ala Gly Ala Ala Asn Ala 
530 535 540 

Met Asn Leu Cys Cys Thr Ala Leu Leu Ala Ser Arg Gin Arg Ser Leu 
545 550 555 560 

Thr Arg Glu Gin Leu Thr Glu Gin Leu Asn Cys Tyr Leu Asp Leu Met 
565 570 575 

Arg Asn Val Pro Tyr Ser Thr Asp Ser Thr Val Pro Ser Ala Ser Ala 
580 585 590 

Ser Glu Leu lie Asp His Ala Leu Gin Met Asn Lys Phe Glu Val Glu 
595 600 605 



Lys Asp Thr lie Gly Asp lie lie lie Leu Pro Arg Glu Gin Ala Val 
610 615 620 
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Leu Met Thr Tyr Tyr Arg Asn Asn lie Ala His Met Leu Val Leu Pro 
625 630 635 640 

Ser Leu Met Ala Thr lie Val Thr Gin His Arg His lie Ser Arg Asp 
645 650 655 

Val Leu Met Glu His Val Asn Val Leu Tyr Pro Met Leu Lys Ala Glu 
660 665 670 

Leu Phe Leu Arg Trp Asp Arg Asp Glu Leu Pro Asp Val lie Asp Ala 
675 680 685 

Leu Ala Asn Glu Met Gin Arg Gin Gly Leu lie Thr Leu Gin Asp Asp 
690 695 700 

Glu Leu His lie Asn Pro Ala His Ser Arg Thr Leu Gin Leu Leu Ala 
705 710 715 720 

Ala Gly Ala Arg Glu Thr Leu Gin Arg Tyr Ala He Thr Phe Trp Leu 
725 730 735 

Leu Ser Ala Asn Pro Ser He Asn Arg Gly Thr Leu Glu Lys Glu Ser 
740 745 750 

Arg Thr Val Ala Gin Arg Leu Ser Val Leu His Gly lie Asn Ala Pro 
755 760 765 

Glu Phe Phe Asp Lys Ala Val Phe Ser Ser Leu Val Leu Thr Leu Arg 
770 775 780 

Asp Glu Gly Tyr He Ser Asp Ser Gly Asp Ala Glu Pro Ala Glu Thr 
785 790 795 800 

Met Lys Val Tyr Gin Leu Leu Ala Glu Leu lie Thr Ser Asp Val Arg 
805 810 815 



Leu Thr He Glu Ser Ala Thr Gin Gly Glu Gly 
820 825 
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<210> 10 
<211> 827 
<212> PRT 

<213> Escherichia coli 
<400> 10 

Met Thr Phe Cys Tyr Pro Cys Arg Ala Phe Ala Leu Leu Thr Arg Gly 
15 10 15 

Phe Thr Ser Phe Met Ser Gly Trp Pro Arg lie Tyr Tyr Lys Leu Leu 
20 25 30 

Asn Leu Pro Leu Ser lie Leu Val Lys Ser Lys Ser lie Pro Ala Asp 
35 40 45 

Pro Ala Pro Glu Leu Gly Leu Asp Thr Ser Arg Pro lie Met Tyr Val 
50 55 60 

Leu Pro Tyr Asn Ser Lys Ala Asp Leu Leu Thr Leu Arg Ala Gin Cys 
65 70 75 80 

Leu Ala His Asp Leu Pro Asp Pro Leu Glu Pro Leu Glu lie Asp Gly 
85 90 95 

Thr Leu Leu Pro Arg Tyr Val Phe lie His Gly Gly Pro Arg Val Phe 
100 105 110 

Thr Tyr Tyr Thr Pro Lys Glu Glu Ser lie Lys Leu Phe His Asp Tyr 
115 120 125 

Leu Asp Leu His Arg Ser Asn Pro Asn Leu Asp Val Gin Met Val Pro 
130 135 140 

Val Ser Val Met Phe Gly Arg Ala Pro Gly Arg Glu Lys Gly Glu Val 
145 150 155 160 



Asn Pro Pro Leu Aorg Met Leu Asn Gly Val Gin Lys Phe Phe Ala Val 
165 170 175 
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Leu Trp Leu Gly Arg Asp Ser Phe Val Arg Phe Ser Pro Ser Val Ser 
180 185 190 

Leu Arg Arg Met Ala Asp Glu His Gly Thr Asp Lys Thr lie Ala Gin 
195 200 205 

Lys Leu Ala Arg Val Ala Arg Met His Phe Ala Arg Gin Arg Leu Ala 
210 215 220 

Ala Val Gly Pro Arg Leu Pro Ala Arg Gin Asp Leu Phe Asn Lys Leu 
225 230 235 240 

Leu Ala Ser Arg Ala lie Ala Lys Ala Val Glu Asp Glu Ala Arg Ser 
245 250 255 

Lys Lys lie Ser His Glu Lys Ala Gin Gin Asn Ala lie Ala Leu Met 
260 265 270 

Glu Glu lie Ala Ala Asn Phe Ser Tyr Glu Met lie Arg Leu Thr Asp 
275 280 285 

Arg lie Leu Gly Phe Thr Trp Asn Arg Leu Tyr Gin Gly lie Asn Val 
290 295 300 

His Asn Ala Glu Arg Val Arg Gin Leu Ala His Asp Gly His Glu Leu 
305 310 315 320 

Val Tyr Val Pro Cys His Arg Ser His Met Asp Tyr Leu Leu Leu Ser 
325 330 335 

Tyr Val Leu Tyr His Gin Gly Leu Val Pro Pro His lie Ala Ala Gly 
340 345 350 

lie Asn Leu Asn Phe Trp Pro Ala Gly Pro lie Phe Arg Arg Leu Gly 
355 360 365 



Ala Phe Phe He Arg Arg Thr Phe Lys Gly Asn Lys Leu Tyr Ser Thr 
370 375 380 
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Val Phe Arg Glu Tyr Leu Gly Glu Leu Phe Ser Arg Gly Tyr Ser Val 
385 390 395 400 

Glu Tyr Phe Val Glu Gly Gly Arg Ser Arg Thr Gly Arg Leu Leu Asp 
405 410 415 

Pro Lys Thr Gly Thr Leu Ser Met Thr lie Gin Ala Met Leu Arg Gly 
420 425 430 

Gly Thr Arg Pro lie Thr Leu lie Pro lie Tyr lie Gly Tyr Glu His 
435 440 445 

Val Met Glu Val Gly Thr Tyr Ala Lys Glu Leu Arg Gly Ala Thr Lys 
450 455 460 

Glu Lys Glu Ser Leu Pro Gin Met Leu Arg Gly Leu Ser Lys Leu Arg 
465 470 475 480 

Asn Leu Gly Gin Gly Tyr Val Asn Phe Gly Glu Pro Met Pro Leu Met 
485 490 495 

Thr Tyr Leu Asn Gin His Val Pro Asp Trp Arg Glu Ser lie Asp Pro 
500 505 510 

lie Glu Ala Val Arg Pro Ala Trp Leu Thr Pro Thr Val Asn Asn lie 
515 520 525 

Ala Ala Asp Leu Met Val Arg He Asn Asn Ala Gly Ala Ala Asn Ala 
530 535 540 

Met Asn Leu Cys Cys Thr Ala Leu Leu Ala Ser Arg Gin Arg Ser Leu 
545 550 555 560 

Thr Arg Glu Gin Leu Thr Glu Gin Leu Asn Cys Tyr Leu Asp Leu Met 
565 570 575 



Arg Asn Val Pro Tyr Ser Thr Asp Ser Thr Val Pro Ser Ala Ser Ala 
580 585 590 
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Ser Glu Leu lie Asp His Ala Leu Gin Met Asn Lys Phe Glu Val Glu 
595 600 605 

Lys Asp Thr lie Gly Asp lie lie lie Leu Pro Arg Glu Gin Ala Val 
610 615 620 

Leu Met Thr Tyr Tyr Arg Asn Asn lie Ala His Met Leu Val Leu Pro 
625 630 635 640 

Ser Leu Met Ala Ala lie Val Thr Gin His Arg His lie Ser Arg Asp 
645 650 655 

Val Leu Met Glu His Val Asn Val Leu Tyr Pro Met Leu Lys Ala Glu 
660 665 670 

Leu Phe Leu Arg Trp Asp Arg Asp Glu Leu Pro Asp Val lie Asp Ala 
675 680 685 

Leu Ala Asn Glu Met Gin Arg Gin Gly Leu lie Thr Leu Gin Asp Asp 
690 695 700 

Glu Leu His lie Asn Pro Ala His Ser Arg Thr Leu Gin Leu Leu Ala 
705 710 715 720 

Ala Gly Ala Arg Glu Thr Leu Gin Arg Tyr Ala lie Thr Phe Trp Leu 
725 730 735 

Leu Ser Ala Asn Pro Ser lie Asn Arg Gly Thr Leu Glu Lys Glu Ser 
740 745 750 

Arg Thr Val Ala Gin Arg Leu Ser Val Leu His Gly lie Asn Ala Pro 
755 760 765 

Glu Phe Phe Asp Lys Ala Val Phe Ser Ser Leu Val Leu Thr Leu Arg 
770 775 780 



Asp Glu Gly Tyr lie Ser Asp Ser Gly Asp Ala Glu Pro Ala Glu Thr 
785 790 795 800 
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Met Lys Val Tyr 



Leu Thr lie Glu 
820 



Gin Leu Leu Ala 
805 

Ser Ala Lys Gin 



Glu Leu lie Thr 
810 

Lys Glu Gly 
825 



Ser Asp Val Arg 
815 



